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Preface 


Electromagnetic theory is beautiful! When looked at from the relativistic 
point of view where electric and magnetic fields are really different aspects 
of the same physical quantity, it exhibits an aesthetically pleasing structure 
which has served as a model for much of modern theoretical physics. Un- 
fortunately this beauty has been all but buried as most textbooks have 
treated electricity, magnetism, Coulomb’s law, and Faraday’s law as almost 
completely independent subjects with the ground work always supplied by 
means of empirical or historical example. Occasionally a chapter is devoted 
to the relativistic coalescence of the various aspects of electromagnetism 
but use is rarely made of the requirement of Lorentz invariance in deriving 
the fundamental laws. 

Our point of view here is quite different. Basically we have two purposes 
in mind—one is to exhibit the essential unity of electromagnetism in its 
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natural, relativistic framework and the other is to show how powerful the 
constraint of relativistic invariance is. To these ends we shall show that all 
electromagnetism follows from electrostatics and the requirement that our 
laws be the simplest ones allowable under the relativistic constraint. The 
hope is that the student will make use of these new insights in thinking about 
theories that are as yet undeveloped and that the model we set here will be 
generally useful in other areas of physics. 

A word about units. Unfortunately one of the results of the completely 
disconnected way in which electricity and magnetism have been taught in 
the past has been the growing acceptance of the mks over the cgs system of 
units. We have no special preference for centimeters over meters or of 
grams over kilograms. We do, however, require a system wherein the 
electric field E and the magnetic field B are in the same units. Using the mks 
system, as it is presently constituted, for electromagnetic theory is akin to 
using a meterstick to measure along an East-West line and a yardstick to 
measure along a North-South line. To measure E and B in different units 
is completely antithetical to the entire notion of relativistic invariance. 
Accordingly we will make use of the cgs (gaussian) system of units ex- 
clusively. Conversion to practical units where necessary can be carried 
out with no difficulty. 

The author would like to express his most profound appreciation 
to Miss Margaret Hazzard for her patient and careful typing of the text. 


MELVIN SCHWARTZ 


1 
Mathematical Review 
and Survey of Some New 
Mathematical Ideas 


It would be delightful if we could start right out doing physics without the 
need for a mathematical introduction. Unfortunately though, this would 
make much of our work immeasurably more laborious. Mathematics is 
much more than a language for dealing with the physical world. It is a 
source of models and abstractions which will enable us to obtain amazing 
new insights into the way in which nature operates. Indeed, the beauty 
and elegance of the physical laws themselves are only apparent when 
expressed in the appropriate mathematical framework. 

1 


2 MATHEMATICAL REVIEW 


We shall try to cover a fair bit of the mathematics we will need in this 
introductory chapter. Several subjects are, however, best treated within 
the context of our physical development and will be covered later. It is 
assumed that the reader has a working familiarity with elementary calculus, 
three-dimensional vectors, and the complex number system. All other 
subjects will be developed as we go along. 


1-1 VECTORS IN THREE DIMENSIONS; 
A REVIEW OF ELEMENTARY NOTIONS 


We begin by reviewing what we have already learned about three-dimensional 
vectors. As we remember from our elementary physics, there are a large 
number of quantities that need three components for their specification. 
Position is, of course, the simplest of these quantities. Others include 
velocity and acceleration. Even though we rarely defined what was meant 
by a vector in mathematically rigorous terms, we were able to develop a 
certain fluency in dealing with them. For example, we learned to add two 
vectors by adding their components. That is, if r; = (x,,y;,2Z,) andr, = 
(x2,¥2,Z2) are two vectors, then 


Ey 0p = (41 FE X2, Vy +.¥a,.21 + Za) 
If a is a number, then 
ar, = (aXx,,ay,,aZ,) 


We also found it convenient to represent a vector by means of an arrow 
whose magnitude was equal to the vector magnitude and whose direction 
was the vector direction. Doing this permitted us to add two vectors by 
placing the “tail” of one at the “head” of the other as in Fig. 1-1. We also 
learned how to obtain a so-called scalar quantity by carrying out a type 
of multiplication with two vectors. If ry = (x;,)1,2;) and r, = (x2,V2,2Z2) 
are two vectors, then r, - rz is defined by the equation 


Py°l. = XX. + Wyy2 + 2422 


It was also shown that r, - r, could be obtained by evaluating |r, | |r| cos 0,2, 
where |r,| and |r,| are, respectively, the magnitudes of r, and r, and 0, 


Fig. 1-1 The addition of two vectors can 
be accomplished by placing the “tail” of 
one at the “head” of the other. 
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is the angle between them. Another so-called vector was obtained by taking 
the cross product of r, and r,. That is, 


Ky XW. = (122 — 223, 24X2 — 22X15 X1 V2 — YX) 


We shall have much more to say about the true nature of this beast very 
shortly. At the moment we just recall that it appears in some respects to 
be a vector whose magnitude is equal to |r,| |r,| sin @,, and whose direction, 
at right angles to both r, and r,, is given by a so-called right-hand rule in 
going from r, to r,. If we look from the head toward the tail of r, x ry, 
we would see the shortest rotation from r, to r, to be in the counterclockwise 
direction. 

Unfortunately, we shall have to relearn much of the above within a 
more abstract framework if we are to make any progress beyond this 
point. We shall have to go back to our basic notions and see if we can 
define what we mean by vector in a more suitable, less intuitive manner. 
Only by doing so will we be prepared to say clearly which combinations of 
three numbers are vectors and which are not. We will also be able to define 
scalar in a reasonable way and will then see our way clear to an under- 
standing of higher-rank tensors. 


1-2. THE TRANSFORMATION PROPERTIES OF VECTORS 
UNDER SPATIAL ROTATION 


To open the way for a more rigorous definition of vector, we proceed a bit 
further with our old intuitive notions. Let us consider a so-called position 
vector, that is, a vector from the origin of our coordinate system to the point 
(x,y,z). If we draw a unit vector along each of the three axes as shown in 
Fig. 1-2 and call them i, j, and k, respectively, we can writer = xi + yj + zk. 
Now, we ask, what if we were to rotate our coordinate system to a new 
set of axes x’, y’, and z’ with a new set of unit vectors a J, and k’? How 
would r be expressed now? We answer this question very simply by expressing 
i, j, and k in terms of the new unit vectors i’, J’, and k’. (This is possible 


(x, y,2) 


Fig. 1-2 The vector r can be expressed as 
r = xi + yj + zk where i, j, and k are 
unit vectors along the x, y, z axes. 


4 MATHEMATICAL REVIEW 


because any three-dimensional vector whatsoever can be expressed either 
in terms of i, j, and k or in terms of i’, j’, and k’.) We write 


= 44,0 + Qg,J + a3,k 


tonto) 
| 


fmt e) 
li 


Ay2i + ago + azok’ @e2=1) 
kK = ay3i + a23)' + aggk’ 
We note the obvious fact that 
i-i = A; i-j =a, i-k = G3, 
j-i ati j-j = a2, j-k = as, 
k-i = ay; k-j’ = a3 k-k’ = a3; 
This, of course, permits us immediately to express the unit vectors i’, j’, and 
k’ in terms of i, j, and k, viz., 
¥V = ayyi + ay.) + ay3k 


j = Hal + Aj ar oak (1-2-2) 


Lk 
I 


a3; PF a3) AF a33k 


We realize that not all the nine quantities a;; can be chosen indepen- 
dently. After all, only three angles are necessary to specify the rotation 
of one coordinate system into another. We expect then to have six equations 
linking the coefficients. We obtain these equations by requiring that 1’, j’, 
and k’ form an orthogonal set of unit vectors. 


i oe | — 1 oN + ay? + a3” 
— — 2 2: 2 
J = 1 =a," + 422° + a3 
k’ - k’ = 1 = aay? ap aay + aay 


(1-2-3) 


= 41421 + Qy2A27 + Ay3423 


a 


0 

, Fe 

JK = 0 = 42,43, + @22432 + a23433 
kK) =o 


= 3143; + Ay2032 + A130433 


Now to return to our original vector r. We can write r in terms of its compo- 
nents in either of two ways: 


r=xi+ yj + zk or r=xT + yj + 7k 
Making use of Eqs. (1-2-1), we find immediately that 


X = Ay,;X + Ayoy + A432 
Q21X + Ag2y + A732 (1-2-4) 
Z = 43,X + G32 + A332 


<= 
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We have traditionally used a right-handed coordinate system to 
specify the components of a vector. That is to say we have chosen i, j, and 
Kk so that if we curl the fingers of our right hand from i to j, our thumb 
will point along k. Expressing this in language somewhat more abstract 
and less anthropomorphic, we can say that i x } = k in such a system. 
Obviously there is nothing in nature that requires us to limit ourselves to 
right-handed coordinate systems, and we might ask if there is anything 
special about the set of numbers a,; if the primed system should happen 
to be a left-handed system. For a left-handed system we can write 


(iv x j)-k’ = -1 (es) 


Expressing i’, j’, and k’ in terms of i, j, and k, we can rewrite this equation 
as follows: 


[(auri + ayoj + ay3k) x (az,1 + a.j + a23k)] 
*(a3,1 + a39) + a33k) = — 
Carrying out the indicated multiplications, we find 
Ay 1(472433 — 423432) + 4,2(42343, — 421433) 
+ 13(421432 — 42243;) = —1 (1-2-6) 


The expression on the left of Eq. (1-2-6) is called the determinant of the 
matrix of numbers a;; or det a,; for short. It is often written in the notation 


411 @y2 43 
det a3; = ar, az2 ar3 


43; 432 433 


We see then that any transformation that takes us from a right-handed 
coordinate system to a left-handed coordinate system is characterized by 
having its determinant equal to —1. Indeed, as we can easily see, the deter- 
minant is equal to —1 whenever we change the handedness of our system 
and +1 if we keep it unchanged. By allowing transformations with either 
sign of determinant, we allow ourselves to deal with both rotations and 
reflections or with any combination of these transformations. 

We have begun to think of our transformation as having an “‘identity”’ 
all its own. It is characterized by a set of nine numbers, which we have 
called a matrix. Furthermore we have seen in Eq. (1-2-4) that we can obtain 
the triplet (x’,y’,z’) by “multiplying” the triplet (x,y,z) by this matrix, 
with the operation of multiplication being defined as 


f 


x G1, Qy2 ayy x Ay3X + Ay2zy + A432 
VY} =] Gar 22 23 Y | =] GarX + Qg2y + Ap32 (1-2-7) 


/ 


Zz 43; 432 433 Zz 31X + Az2y + 332 
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We can represent the above operation symbolically by writing 
ia="ar (1-2-8) 


(In the future, a boldface sans serif symbol, such as a, will mean that the 
symbol is a matrix and not a number.) 

Suppose now that we wish to undertake two successive transforma- 
tions, the first characterized by a and the second by another matrix b. 
If we begin with the triplet r, then the first transformation leads to the 
triplet r’ and the second to the triplet r’. That is, 


r = ar 
r’ ==: br’ 


Alternatively, we might have gone directly from the unprimed to the double- 
primed coordinate system by means of a transformation c. 


r’ =cr 


Writing out these transformations in detail will show that we could determine 
all the elements of c directly from a and b by means of the simple set of 
equations 


Cry = byyQyy + by 2421 + by 303; 
Cra = 51 G12 + by2422 + 513432 


or, in general, 
Cig = bi; + i242; + ;343; 


We abbreviate this in the customary way by writing 


3 
CS »> bi A; (1-2-9) 
Thus the element c;; can be obtained by taking the “scalar product,” so to 
speak, of the ith row in b with the jth column in a. 

The operation which we have defined above in Eq. (1-2-9) is called the 
product of two matrices a and b and can be represented by the expression 
c = ba. Matrix multiplication, unlike the multiplication of two numbers, 
is not in general commutative, as the reader can very easily convince himself. 
That is to say the product ab is not in general equal to the product ba. 
Multiplication is, however, associative. This means that we can in general 
write, for three transformations a, b, and c, 


a(be) = (ab)c (1-2-10) 


To complete our picture we should point out that one of the possible 
transformations is the identity transformation which leaves the coordinate 
system unchanged. We write this matrix as 1 with the observation that 
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100 
7 = nO. 0 (ioe) 
001 


Returning back to Eqs. (1-2-1) and (1-2-2), we see that for every transforma- 
tion a there is also an inverse transformation a~‘ such that 
=a ‘a=1 (lezet2) 


aa! 
The inverse transformation is just given by the transposed matrix. That is 
to say 


a (1-2-13) 
(For those whose mathematical sophistication is just a bit above average, 
we might point out that the set of all transformations defined above con- 
stitute what is known in the trade as a group. The detailed properties of 
groups play an important role in the development of much of quantum 
mechanics and should be studied at the earliest possible moment by those 
who intend to extend their horizons in physics beyond the classical domain.) 

We can now think in terms of the complete set of all transformations 
from one orthogonal coordinate system to another, including within our 
set both rotations (det a = +1)and reflections (det a = — 1). The definition 
of scalar, vector, and various other entities is now best done in terms of this 
set of transformations. 

Let us begin with what is intuitively the simplest of these entities, the 
scalar. Imagine that we are given a set of explicit instructions for determining 
some number. We follow these instructions scrupulously, coming up with a 
value for the number. We can now rotate our coordinate system or change 
its handedness (by means of the transformation a). If the same set of rules 
for determining the number leads to the same result in the new system, 
regardless of the choice of rotation or reflection, then the number is a scalar. 

Obviously there are innumerable trivial examples of scalars that we 
can readily cite. The number of cents in the dollar or the number of fingers 
on your hand have nothing to do with the coordinate system and hence are 
ipso facto scalars. Much less trivial, though, are numbers that are derived 
by means of rules which concern coordinates themselves. Let us take a 
simple example. 

Suppose the rule tells us to take the x coordinate of a point, square it, 
add to that the square of the y coordinate of the same point, and add to 
the sum the square of the z coordinate of the point. We would have then a 
number equal to x? + y? + z’. If we transform to a new system and follow 
the same prescription in the new system, we come up with x’? + y? 4+ 2’. 
Unless we knew the Pythagorean theorem we would have no a priori 


a 


8 MATHEMATICAL REVIEW 


expectation that the same rule applied in these different systems would give 
us the same result. Indeed it does because we have just determined the 
square of the distance from our point to the origin, and that quantity does 
not depend on the noe sical or the handedness of our system. 
Clearly then the number x? + y? + z? is a scalar. 

Let us try a more difficult example now. Consider two points whose 
coordinates in one system are (x,,y,,Z,) and (x2,y2,Z2). We can form the 
expression x, xX. + y,;y2 + 2,22 and evaluate it in this coordinate system. 
We can now transform coordinates and evaluate the same expression in 
the new system, obtaining x}x> + y;y2 + 2,2. Again we have no a priori 
expectation that the two numbers will come out to be the same. Making 
use of Eqs. (1-2-4) and (1-2-3), the reader can easily convince himself that 
this is, however, the case—the numbers are the same and so the expression 
X1X_. + Vi v2 + 2,22 is a scalar. (The result is not entirely unanticipated 
for we remember that this expression is the scalar product of r,; and r, 
and can also be written as |r,||r,|cos 6. The latter formula does not depend 
on the coordinate system.) 

There is a great temptation now to let every “constant’’ of nature, 
like charge and mass, be labeled a scalar. In fact we must be exceedingly 
careful since an attribute like charge is defined operationally in terms of 
forces by external fields, and we must investigate the behavior of the entire 
system under both rotation and reflection before we can conclude that the 
attribute is a scalar. We shall have more to say about this very shortly. 

We go on now to the definition of another important entity, the 
pseudoscalar. The pseudoscalar differs from the scalar in only one important 
respect. The sign of the number we obtain by following our prescription 
in a left-handed coordinate system is opposite to that we obtain in a right- 
handed system. For pure rotations, scalars and pseudoscalars behave 
identically. 

To find an example of a pseudoscalar is not difficult at all. Let us take 
three points in space which in one coordinate system have the components 
(X1,¥15Z1), (%2,¥25Z2), and (x3,y3,Z3). We can construct a determinant D 
out of these nine numbers: 


Xi Vi 2% 
X2 y2 22 
x3 3 23 
X1(¥2Z3 — V3Z2) + Yi(Z2X3 — 23X2) + Z4(%2¥3 — X32) 
(1-2-14) 


(It is quite clear that D is equal to r,+(r, x r3) and has magnitude equal 
to the volume of the parallelopiped determined by r,, r,, and r,.) If we 


S 
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change coordinates by means of a pure rotation with deta = +1, we 
find that D’ as evaluated in the néw system for the same three points is 
unchanged. On the other hand, if we change from a right-handed to a left- 
handed system, D’ changes sign. (Take the simplest such transformation 
corresponding to x’ = —x, y = —y, and z’ = —z and substitute above. 
Then rotate to any other left-handed system.) 

So far we have been dealing with prescriptions whereby we come up 
with single numbers. However, we have already discovered some entities 
which require three components for their specification, like the coordinates 
of a point. This brings us then to another class of mathematical objects 
which we shall call the polar vector. 

Imagine that we have a prescription for calculating a triplet and 
obtain (v,,v2,v3) as the result of following this prescription. Consider 
next that we transform to a new coordinate system by means of the trans- 
formation a and then apply the same prescription as before. We would 
obtain a new triplet (v;,v3,v3). These triplets are the components of a 
polar vector v if and only if, for any choice of a, we have 


¢ 

Vy = QyyVy + QyQU2 + A430 

V2 = A210, + 42202 + 42303 (1-2-15) 
/ 

Vz = Q31,V, + A3202 + A3303 


Obviously the three coordinates of a point (x,y,z) constitute a polar vector. 
So do the three components of velocity and acceleration. Using the notation 
we have developed earlier, we can write 

Vv = ay 


An important characteristic of a polar vector is the fact that v changes 
sign under the pure inversion represented by 


=. soe 0 
a =| Sa (1-2-16) 
0. 01 


Under this transformation, of course, 


Uy 4 
vn = —02 (1-2-17) 
Ue Os 


On the other hand, we can imagine a triplet of numbers which behaves 
exactly like a polar vector under rotation but does not change sign under 
inversion. Such an entity comes under the classification of axial vector. 
To construct such a triplet we need only take the vector product of two 
polar vectors v and w: 
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u=Vvxw 
= [(v.W3 — 03W2), (03W, — 013), (VW. — ¥2W,)] (1-2-18) 


Under an inversion, 
v=-—-Vv w = —-w but uw = +u (1-2-19) 


We shall shortly discuss more complex mathematical entities with 
more than three components. For the moment though, let us pause and see 
if we can understand the physical importance of what we have done. 

One of our underlying physical principles is that there is no preferred 
direction or handedness! to the universe. This means that the basic laws 
of nature cannot depend on the coordinate system we choose to use for 
their formulation. Now basic physical laws are written down as equations. 
If the two sides of an equation do not have the same transformation proper- 
ties then the form of the equation will depend on the coordinate system we 
chose, thereby violating our principle. For example, suppose our equation 
sets a scalar quantity equal to a pseudoscalar quantity. If we reverse the 
handedness of our system, one side of the equation would change sign and 
the other would not, leading to an obvious change in the appearance of the 
physical law. To avoid these problems we shall agree never to write down a 
basic equation in classical physics where the two sides do not behave 
identically under transformation. That means that we will always equate 
scalars with scalars, polar vectors with polar vectors, axial vectors with 
axial vectors, and so forth. In this way, if an equation is true in one co- 
ordinate system then the identical equation will be true in any system 
related to it by rotations or inversions. 

We should emphasize that in all the above we are only talking about 
those equations which describe the fundamental physical laws. In applying 
the physical laws to any specific situation, we will usually find that there is a 
preferred coordinate system to use and hence what we have said above 
would not necessarily hold true. For example, if we were studying the tra- 
jectory followed by a baseball near the earth’s surface we would naturally 
choose one of our coordinate axes in the upward direction. On the other 
hand, when we write down a general set of laws governing the behavior 
of magnetic and electric fields (Maxwell’s equations), we will certainly 
insist that no preference be given to any coordinate system or to a particular 
handedness of our system. 

The type of reasoning we have just described plays a particularly 
important role in electromagnetism, and we would like to take the liberty 
of drawing on some illustrations here even though we have not developed 


! This belief has been shaken in recent years by the discovery of parity violation and the violation 
of time-reversal invariance in the weak interactions. Nevertheless it is still true to the best 
of our knowledge in classical electromagnetic theory. 
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the subject yet. (Hopefully the student taking this course has already studied 
some elementary physics before.) The first discovery we will make in electro- 
statics is Coulomb’s law, where we will find that the force by one charge 
on another is proportional to the product of the charges and inversely 
proportional to the distance between them. Looking ahead to Eq. (2-1-1), 
we shall write 


F,, = force by charge | on charge 2 


a. 9192(%2 — 14) 


Ir. = 7|° 


Now, at the moment we do not know whether charge is a scalar or a pseudo- 
scalar. Regardless of what it is though, the product of two charges q,q 
is a scalar. Hence the right side of the equation is a polar vector. This in 
turn means that force is a polar vector. Since force is equal to mass times 
acceleration, we next conclude that mass is a scalar (under these three- 
dimensional transformations). 

What about charge itself though? Whenever we see an elementary 
force in electromagnetism it is always proportional to the product of charges. 
Hence, there is no way of determining whether charge is a scalar or pseudo- 
scalar quantity. Since it makes no difference, we will assume it to be a scalar. 
This implies immediately that electric field E is a polar vector since it is 
just equal to force per unit charge. _ 

How about magnetic field? As we shall learn, a magnetic field B 
exerts a force on a charge q given by 


F=q—xB 


where c is the velocity of light. This tells us that B is an axial vector. 

In Chap. 3 we will discuss an experiment to search for magnetic 
monopoles. These, if they exist, are elementary magnetic charges which 
are acted upon by a magnetic field in the same way as ordinary electric 
charges are acted upon by an electric field. We would have then, for a 
magnetic charge q,, a force given by 


F, = mB 


Since B is an axial vector, we conclude that q,, is a pseudoscalar quantity. 
Obviously, if we had begun by choosing electric charge to be a pseudo- 
scalar, then we would now come out with magnetic charge as a scalar. 
We can apply the principles we have discussed above in another 
manner to help us in determining the physical laws themselves. Suppose 
we have determined that a part of our physical law states that a partic- 
ular component of a given polar (or axial) vector v is equal to the same 
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component of another polar (or axial) vector w. We can choose our x 
axis along the direction in which the components are known to be equal 
and then summarize our knowledge by the statement that v, = w, in this 
coordinate system. If we further know that there is no preferred system 
for the laws we are uncovering, then we can deduce that v, = w, and v, = w, 
also. In other words, v = w in all systems. We shall make extensive use of 
this procedure when we introduce magnetism in Chap. 3. The only difference 
will be that we will be working in the four-dimensional world of special 
relativity. It is at that point that the full beauty of the notions we have 
developed here will become apparent. 


1-3. DIFFERENTIATION OF VECTORS WITH RESPECT TO TIME 
AND POSITION; THE ‘’DEL’’ OPERATOR (V) AS A VECTOR 


Since we know how to add and subtract vectors, we know how to differentiate 
them. If r(t) = (x(4), y(0,z(4)) is a position vector of a moving particle, 
then we have 


: tim r(¢ + = - HD (1-3-1) 
But 
r(t + At) — r(t) = [x(t + Ad) — x(2), 
y(t + At) — y(d), z(t + Ad — 2(d)] 
and hence 
ar(t dx 4 


We define dr/dt to be the velocity vector v(t) of our particle. Similarly we 
can define acceleration by 


dy Oxod vy aez ' 
a= a = dr de dt (1-3-3) 


We note immediately that both v(‘) and a(#) are vectors.! This is so because 
the variable t is independent of the orientation of our coordinate system and 
can be kept constant as we carry out a rotation. If instead of ¢ we had used x, 
the triplet created would not have been a vector at all. 

Before we can treat derivatives with respect to the coordinates x, y, z, 
we must learn about partial differentiation, a trivial extension of ordinary 
differentiation. Let f be a function of the three variables x, y, z. The partial 


‘In the future we shall not differentiate between polar-and axial vectors unless it is specifically 
important to do so. We shall call them both vectors. 
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derivative of f with respect to any. of these variables is defined to be the 
ordinary derivative if we keep the other two variables fixed. Thus we have, 
for example, 


Of(x,y,Z) = lim I(x a3 dx, y, z) = f{(XV.2) 


Ox Ax=0 Ax iaal 


Suppose now we start from some point (x,y,z) and move to a new, nearby 
position (x + Ax, y + Ay, z + Az). How much does f change? The answer 
is simple if we carry out our motion in three steps. First, we move from x 
to x + dx and the function changes by an amount (df/dx) Ax. Then we 
move in the y direction by an amount Ay. The function now changes by 
an amount (df/dy) Ay. Last, we move in the z direction by an amount 
Az. The function changes by an additional amount (0f/0z) Az. (It is assumed 
in all this that Ax, Ay, and Az are sufficiently small so that the derivatives 
do not vary significantly enough over this interval to affect these approxima- 
tions.) 

Before we add these changes together let us, in the customary manner, 
replace the A’s by d’s, signifying that we are dealing with infinitesimal dis- 
placements. We have then for the total change in /: 


hy of 


Now the triplet (dx,dy,dz) is surely a vector, and the change in f as we go 
from one place to another is surely a scalar (it does not depend on the 
orientation of our coordinate system). The right side of Eq. (1-3-5) looks 
like the scalar product of the triplet (0f/0z, df/oy, 0f/dz) and the vector 
(dx,dy,dz). We expect then that the triplet (df/0x, df/dy, df/éz) is a vector, 
and we can demonstrate this explicitly. [This triplet is called the gradient 
of for Vf (“del #”’). } 

To do so we must learn how to transform from one set of variables 
(x,y,z) to another (x’,y’,z’). Let us for convenience choose the displacement 
which led to df in such a way that only x’ is changed and not y’ or z’. If we 
divide Eq. (1-3-5) by dx’ on both sides, we have 


Sf _ of Ox of oy | Of o& 
6x’ Ox Ox’ dy dx’ * Gz ax’ 


pe. Fae (1-3-5) 


(1-3-6) 


Similarly, if we allow our displacements to be dy’ and dz’, we would obtain 
the equations 


of _ of ox of oy of oz (1-3-7) 
oy’ ~ Ox dy’ éy ay’ Oz oy’ 
of _ of ox . of ey “ of oz (1-3-8) 


éz’ Ox dz’ dy dz’ az Gz’ 
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Going back to Eqs. (1-2-4), we find 


Ox oy | ” Oz aap 

Ox’ bare ay Ox’ Se boils Ox’ aS 

Ox oy oz 

ay’ = az, ay’ = Q22 ay’ = a73 (1-3-9) 
Ox oy Oz 

Bln 8G ete we? 


Substituting back into (1-3-6) to (1-3-8), we obtain appropriate transforma- 
tion equations for Vf: 


ao of af of 


=. 1 Gio tee1ao 
Ox’ 11 ox 12 ay 13 6z 


mpage i of of 
Bye acon eee ae Ay (1-3-10) 


of af af af 


a” 731 ay * SAME? ap 


Hence the gradient of f does indeed transform as a vector under rotation. 
The interesting thing, though, is that the transformation properties we are 
interested in do not concern f at all. They only concern the triplet of dif- 
ferential operators V = (0/0x,0/dy,0/6dz). Although V really means nothing 
unless something appears on its right, it nevertheless has all the character- 
istics of a polar vector under both rotation or inversion. 

In any case let us see what the gradient is like ‘“‘physically.”” We see 
from Eq. (1-3-5) that the function f changes by an amount V/- dr if we 
undergo a displacement dr. Thus Vf points along the direction in which f 
changes most rapidly. Its magnitude is the rate of change of f with respect 
to distance along that direction. 

We shall have much more to do with our “del” operator as we proceed 
to develop a number of other mathematical and physical concepts. 


1-4 THE NOTION OF FLUX; DIVERGENCE 
OF A VECTOR FIELD; GAUSS’ THEOREM 


As we examine the world around us we find numerous instances where we 
need to speak of vector fields, that is, vector functions of position defined 
for all points within a given volume. The velocity or momentum density of 
fluids, electric and magnetic fields, and gravitational forces are examples 
that we come across quite readily. In each of these cases we will often make 
use of a very simple notion—the flux of the vector field v(x,y,z) through an 
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imaginary infinitesimal bit of area dA with normal vector n. We first define 
it mathematically. 


Flux through @A in direction of f = v - idA 


The reason for calling this quantity flux is reasonably clear. As we see from 
Fig. 1-3, if v is the velocity of a fluid of uniform density, then the flux is 
just the volume of fluid that passes through the surface AA in 1 sec. 

Needless to say, we can now proceed to integrate the flux over a finite 
area A. We have then 


Flux of v through A in direction of fi = | v-fidd (1-4-1) 
A 

We now prove a simple but powerful theorem which relates the net outward- 
going flux of v through a closed surface to the space derivatives of v within 
the surface. This theorem will be of particular value to us in our study of 
electrostatics. We begin by taking the infinitesimal volume bounded by 
the range of coordinates x to x + dx, yto y + dy, ztoz + dz, and shown 
in Fig. 1-4. 

Let wall 1 be the wall parallel to the xy plane through the point (x,y,z). 

Let wall 2 be the wall parallel to the xy plane through the point 
(x + dx, y + dy, z + dz). 

Let wall 3 be the wall parallel to the yz plane through the point (x,y,z). 


Fig.1-3 The flux through dA of the vector field v is just v- A dA. If v is the 
velocity of a fluid of uniform density, then v- fi dA is just the volume of fluid 
passing through AA in J sec. 


Volume passing through AA in 1 sec 
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Wall 2 


(x + dx, y + dy,z + dz) 


Fig. 1-4 We calculate the outward-going 
flux of the vector field v from the infinites- 
x imal volume shown here. 


Let wall 4 be the wall parallel to the yz plane through the point 
(x + dx, y + dy, z + dz). 
Let wall 5 be the wall parallel to the xz plane through the point (x,y,z). 
Let wall 6 be the wall parallel to the xz plane through the point 
(x + dx, y + dy,z + dz). 
_ The area of wall | is dx dy. The outward-going normal to wall | is 
—k. 
The area of wall 2 is dx dy. The outward-going normal to wall 2 is 


The area of wall 3 is dy dz. The outward-going normal to wall 3 is 
The area of wall 4 is dy dz. The outward-going normal to wall 4 is 
The area of wall 5 is dx dz. The outward-going normal to wall 5 is 
The area of wall 6 is dx dz. The outward-going normal to wall 6 is 
The outward-going flux through wall 1 is —v(1) - k dx dy. 


The outward-going flux through wall 2 is v(2) - k dx dy. 
The net outward-going flux through walls 1 and 2 is then 


(Net flux), 4. = [v(2) — v(1)] - k dx dy 
= [v,(2) — v,(1)] dx dy 


= (3) dz dx dy 
0z 


Similarly 


Ov, 


t fi = | —— 
(Neminx), 5 (Zs) a dy ae 
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and 


(Net flux)... = (3) dy dx dz 


Noting that the volume of dV is dx dy dz, we have 


Ov Ov Ov 
N = || —_ Sere PAe 
et flux out (F a. By + x) av (1-4-2) 


We note now that the term (0v,/0x + dv,/dy + dv,/6z) is just V - v; that is, 
it is the scalar product of the “del” operator and the vector function v. 
Just like any scalar product of two vectors it must be invariant under rota- 
tion. This scalar product V - v is generally called the divergence of v. 

As we see from Eq. (1-4-2) the divergence of a vector function of space 
is a measure of the extent to which there are local sources (or sinks) present, 
that is, the extent to which there is a net flux out of (or into) a region. 

We can extend this very simply to a finite volume V enclosed by a 
surface S. The volume can be broken into infinitesimal bits dV. Let us 
sum together the flux leaving all bits. The flux leaving one volume bit will 
either pass through S or enter another volume bit. Hence we have the 
result 


- Sum of all flux leaving all volume bits = flux leaving through S 


In mathematical terms we can write 


| (V-vdV= | (vf) dA (1-4-3) 
volume V surface S 


bounding V 


This result, known as Gauss’ theorem, will turn out to be tremendously 
useful as we proceed with our development of electromagnetic theory. 

Returning to Eq. (1-4-2), let us make use of it to derive a differential 
equation for the flow of a fluid. In this case, v is the velocity of the fluid 
at a given point. Let p be the density of fluid. The flux of mass out of the 
infinitesimal volume dV is just V - pv dV. The amount of mass in the volume 
is just p dV. Conservation of mass then tells us that 


dp 
: see) 1-4-4 
V-pv+ - ( ) 
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This equation will also be encountered later when we discuss charge con- 
servation and will play a rather important role in our relativistic develop- 
ment of electromagnetism. 


1-5 THE CURL OF A VECTOR FUNCTION 
OF SPACE; STOKES’ THEOREM 


We have come across the notion of a line integral in the past, when we 
covered work and energy in our elementary mechanics course. At that time 
we learned that the bit of work dW done by a force F on a given point object 
undergoing a displacement dl is just F - dl. If we wish to determine the 
work done in going from point A to point B along a given path, then we 
divide our path into infinitesimal bits and add together the contributions 
from each of the bits. That is, 


B 
w=| F- dl 
A 


Sometimes the path from A to B would be irrelevant; for any A or B the 
work done would be independent of the path taken between them. In that 
case we called the force conservative and observed that the line integral 
of F - dl around any closed loop would then be zero; that is, $F -dl=0 
for a conservative force. 

We now broaden our horizons a bit and consider the integral around 
a closed loop of the function v - dl where v(x,y,z) is a vector function of 
space and dlis an element of the loop. We propose to prove a very important 
theorem about this integral, relating it mathematically to a surface integral 
over any surface bounded by our closed loop. To begin with, though, we 
examine a simple rectangular path in the xy plane with sides that are of 
length dx and dy, respectively. We choose the direction of the path as 
shown in Fig. 1-5 and number the legs of the rectangle as indicated. Along 
leg (1), 

v- dl = [v(1) + i] ax 

= v,(1) dx 

Along leg (2), 

v-dl = v,(2) dy 
Along leg (3), 

v-dl = —»,(3) dx 
Along leg (4), 

v-dl = —v,(4) dy 
Adding these together, we have 


} vat ~ (2.1) — hie ee) — oes 
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x,y + dy ©) x + dx, y + dy 


® @ 


Fig. 1-5 We carry out the 
“line integral” of v- dl around 
an infinitesimal loop in the xy 


plane. 
but 
Ov 
v,(3) — v,(1) = = 
( ay ly 
ov 
v,(2) — v,(4) = 3x 
Hence 
0 
; v-dl= (= i =| dy dx (1-5-1) 
ee Ox oy 


Now the left side of the equation is an invariant under rotation. (Is it a 
scalar or a pseudoscalar?) Hence we must rewrite the right side so that it 
also is an invariant under rotation. 


Let ft be a unit vector normal to dA in the direction given by a right-hand 
rule. That is, ff points in the direction of the thumb if the fingers are along 
the direction of the path of integration. Then 
} v-dl =(V x v)- fidA (1-5-2) 
infinitesimal 


loop 
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The right-hand side is now in the form where it clearly exhibits the same 
invariance properties as the left-hand side. Hence we can now rotate our 
coordinate system arbitrarily, and the equation will still be true. It is thus 
true in general regardless of the orientation of the bit of area relative to the 
Xx, ¥, 2 aes 

We now examine the situation of a finite loop C shown in Fig. 1-6. 
We begin by choosing a direction in which we are to go around the loop. 
We then cover the loop with a surface S and establish a normal to each point 
on S in accordance with the same right-hand rule we used earlier. We now 
break up S into infinitesimal loops as shown. Clearly 


; v-dl=) ¢ v-dl 
Cc i Jinfinitesimal 
loop C; 


The sum at the right is converted to an integral, and we have Stokes’ theorem 


$v - dl -| (V x v)-fi'dA (1-5-3) 
Cc surface bounded 
by C 


We observe then that V x v is a measure of the extent to which v “curls” 
about, giving a nonzero line integral around a loop. If F is a conservative 


Fig.1-6 An arbitrary loop C is broken into a set of infinitesimal rec- 
tangular loops C;. 


a Infinitesimal 
rectangular 
loop C; 
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force, then we must have 
; F-dl=0 for any closed path 


Hence | 
Vx F=0 for a conservative force 
It is important to note here that the choice of surface was arbitrary. Let 


S, and S, be two surfaces, each bounded by C. Since either surface is 
satisfactory, we can write 


PV xv add = $v xv. ad 
Si s 


2 


If we let V be the volume enclosed by the two surfaces, we conclude from 
Gauss’ theorem that 


[0 -¥ x war=o 
V 


Hence we may expect that V - V x v = 0 for any vector function v. The 
reader may verify this result explicitly. 


1-6 TENSORS OF THE SECOND RANK 


We now go a bit beyond the notion of a vector to introduce a new type 
of mathematical object, a tensor of the second rank. We have seen that a 
vector is best defined in terms of its transformation properties under a 
rotation. The same type of definition will be given here. However, before 
that, let us play about a bit with a simple example. 
Consider the set of nine numbers that are obtained by multiplying 
the three components of the vector (v,,v2,v3) by three components of the 
vector (w,,W2,W3). We can list these nine numbers conveniently in the form 
of a matrix which we call T. 


DiWy DyW2 VyW3 
T = DW, DW vzW; 
V3W, V3W2 U3W3 


If we had used a different coordinate system, our two vectors would, of 
course, have had components (vj,v03,03) and (w1,w2,w3) and we would 
have come up with a listing 


iW, ViW, v{W3 
T’ =| vjw, 02Wz 03W3 


V3W, 03W, 03W3 


22 MATHEMATICAL REVIEW 


Now, we might ask, how is an element of T’ related to an element of T? 
The answer is trivial since we already know precisely how to transform the 
v’s and w’s into v’’s and w’’s [see Eq. (1-2-4)]. 


, 


3 
UES 3 Qi Vy 
k=1 


/ 


3 
Ud lee Sy QimWm 
m=1 


Hence we can obtain the element v;w; by simple multiplication of the two 
relevant sums. 


3° 3 
vw) = Do DY Gin@jn) (Oem) (1-6-1) 


k=1 m=1 


Alternatively, we can go the other way by the transformation 


38 

UW; = M3 »y (24:4 nj) (Vn) (1-6-2) 

m=1 k=1 
We have then a clear-cut way in which the nine elements in the matrix 
T’ can be written in terms of the nine elements in the matrix T and vice 
versa. Each element in one is a sum of coefficients times the elements of 
the other. The coefficients themselves only depend upon the rotation matrix 
a between the two coordinate systems. 

We now define a tensor of the second rank as an array of nine objects 
which transforms in the same manner as the elements v;w, under rotation. 
Thus we have, for any tensor T, 

Ta = » Ay; T; 


ij 
ij 


(1-6-3) 


Incidentally, having learned to multiply matrices [see Eq. (1-2-7)], we can 
rewrite Eq. (1-6-3) in a simple form. 


T =aTa™! : (1-6-4) 
where a“! is the inverse rotation matrix. We remember that 
a; * = aj (1-6-5) 


To get a better feeling for what a tensor actually is in physical terms, 
let us prove a simple mathematical theorem. We will show that the product 
of a second-rank tensor T and a vector v transforms as a vector. To do this, 
let us first write down the product Tv as it appears in both the primed and 
the unprimed coordinate systems. We will call the components of these 
products w; and wi, respectively. 

w, = > Tr (1-6-6) 


t 
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Wi = » Ti,0; ' (1-6-7) 
What we would like to show, of course, is that the three numbers w; in Eq. 
(1-6-6) transform into the three numbers in Eq. (1-6-7) precisely as would 
the three components of a vector. 

To prove this theorem, let us just make use of the fundamental trans- 
formation properties of T and v. 


-_ / 
Tj; = » Ay jy; Th 
v; = 2 AmiVy m 


We substitute these expressions back into Eq. (1-6-6), obtaining thereby 
Wp ve Aj; Ani Th Vin (1-6-9) 


i,k,l,m 


(1-6-8) 


To evaluate this expression, we sum over i first. We recall Eq. (1-2-3), 
which can be written in simplified notation as 


a AAmi = Sim (1-6-10) 


where 6,,,, called the Kronecker 6, is zero when m is unequal to / and 1 
when m is equal to /. Substituting Eq. (1-6-10) back into Eq. (1-6-9), we 
obtain 

w=) Sm xj Ta Vin 


J 
k,l,m 


We next sum over m and observe that only the terms with m = / remain. 
W; = Yj Tai 


Finally, summing over / and making use of Eq. (1-6-7), we have 
Wi = DW (1-6-11) 
k 


This is exactly the transformation property we require of a vector. Hence 
we have proven our theorem. 

We now have some insight into what a tensor really is. It is a linear 
relationship between two vectors of the form given by Eq. (1-6-6). Whenever 
we encounter a situation in physics where two vectors depend linearly on 
one another but point in different directions, the relationship between them 
will be tensorial in nature. For example, the stress and strain in material 
are linearly related when the material behaves elastically, yet they do not 
necessarily point in the same direction. The relationship between them is 
called the stress tensor. 

To add two tensors we just add the individual components. This 
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permits us to make a very simple decomposition of any tensor T into 
symmetric and antisymmetric parts. Let 


T=. AsatlS (1-6-12) 
where 

Ape me = -A; (1-6-13) 

c= ae es, (1-6-14) 


The tensor A has only three components, and the tensor § has six compo- 
nents. This decomposition would be pointless were it not for the fact that 
A remains antisymmetric and S remains symmetric as we rotate. 


Ai = Yanan Ag = — 244A = — Aji (1-6-15) 
Kl kl 

Sig = nay Sa = + DY GAS = +S (1-6-16) 
kl kl 


Let us take a look at the three components of A,; in terms of their trans- 
formation properties under rotation. 


A, transforms as (v,;w2 — 02W;) 
A,3 transforms as (v,;w3 — 03W4) 
A>; transforms as (v,W3 — V3W2) 


Thus the triplet (A,,,—A,3,4,2) transforms like the vector product of 
two vectors. A vector product is simply an antisymmetric tensor of the 
second rank. 


1-7. DIAGONALIZING A SECOND-RANK SYMMETRIC TENSOR 


An interesting and es property of a symmetric tensor S is the fact 
that it can be “‘diagonalized.”’ This means that we can rotate toa coordinate 
system given by a triplet of orthogonal unit vectors i’, j’, and k’ within 
which S’ has only three diagonal elements, namely, those Sj; for which 
i = j. All off-diagonal elements are equal to zero in this coordinate system. 
We will show explicitly that this 1 is so by finding the vectors 1’, j’, and k’ in 
terms of the initial base vectors i, j, and k and then rotating to the new 
system. 

Before doing this, though, let us assume the result to be true and write 
out the tensor S;, as we expect it to appear: 


A; ONO 
SH 4,0 (1-7-1) 
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We notice immediately that S’/ = 1,1’, SJ’ = A,j’, and S’k’ = A,k’. This 
gives us a clue as to how to proceed, starting out with the tensor S and its 
six components. We will search for unit vectors which satisfy the equation 


Sa = Af (1-922) 


Hopefully we will find three such vectors which will be orthogonal and which 
we can then identify as 1’, j’, and k’. 
Let us write out Eq. (1-7-2) explicitly: 


Sir Siz S43 ny n, 
Si2 So S53 ny = A ny (1-7-3) 
Si3 S23 S33 n, n, 


Expanding the above equation, we find three equations for n,, n,, and n,: 
(Si, — An, + S,2n, + Sy3n, = 0 
Sion, + (S22 — Any + Sz3n, = 0 (1-7-4) 
S13, + Sp3n, + (S33 — An, = 0 


In order that we have a nonzero solution for n,, n,, and n,, we must set the 
determinant of these equations equal to zero. That is, we must have 


Sir- 4 Sip Si3 
S12 S22 —A S23 =0 Ci-7P5) 
S13 Sp3 S33 — 4 


Clearly this yields a cubic equation for A with three solutions which we can 
call A,, 22, and A;. We can now throw out one of Eqs. (1-7-4) because they 
are no longer all independent and use the other two to determine n,/n, and 
n,/n, for each choice of 4. Normalizing so that n,? + n,? + n,? = 1 com- 
pletes our job of determining fi,, f,, and fi3. 

Now, before we can identify fi,, f,, and fi, with i’, j’, and k’, we must 
show that they are orthogonal. Let us assume first that /,, 2,, and A, are all 
different. Then we have from Eq. (1-7-2) 


SA, I Ay ft, 
Sh, = Azh, (1-7-6) 
SA, = Ash, 


We take the scalar product of the first equation with ft, and of the second 
equation with fi, and then subtract the second equation from the first. 
This leads to 


fi, . Sh, am fi, ° Sh, = (A, = A,)h, *fl, (1-7-7) 
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Because of the symmetry of S we see immediately that A, «SA, = fi, -SfA,. 
Hence we conclude that 


(A, — Ap)ft, fy = 0 (1-7-8) 


Since 4, + A,, this equation implies that f, and fi, are orthogonal. The 
same proof now demonstrates that fi, is also orthogonal to both f, and 
fi,. Hence if A,, ,, and A, are all different, we can identify them with i’, 
j, and k’ and arrange the correspondence so as to make up a right-handed 
coordinate system. 

The three constants 4,, 4,, and A, are called the eigenvalues of Eq. 
(1-7-2) and the corresponding solutions are the eigenvectors fi,, fi,, and fs. 
We might ask what happens, for example, ifA, = A. It is easy to demonstrate 
then (see Prob. 1-10) that any vector normal to f, is an eigenvector [solution 
to Eq. (1-7-2)] with eigenvalue 1,. Thus we can always choose two orthogonal 
ones which we can call ft, and ft,, respectively, thereby completing our 
coordinate system. 

We shall come back to the subject of tensors when we find them useful 
in electromagnetism. Suffice it to say at this point that the electric and 
magnetic fields which we have always thought of as vectors will turn out 
to be parts of a four-dimensional tensor of the second rank. 


PROBLEMS 


1-1. Demonstrate the following vector identities. 
(a) A x (B x C) = (A- OB — (A- B)C 
(b) A-(B x C) = (A x B)-C 
(c) Vuv = uVv + vVu 
(dq) V-uA=uV-A+A-Vu 
(ec) V x uvA=uV x A+ VuxA 
()V-(Ax B)=B-VxA-A:VxB 
(g) Vx (V x A)=V(V- A) —V7A 
aA ‘ 07A . aA 
ax dy? 62? 
(h) V(A-B) = A x (V x B) + (A: V)B+ B x (V x A) + (B-V)A 
(i) V x (A x B) = (B- V)A — (A- V)B + A(V - B) — B(V- A) 


1-2. Show explicitly that 
V-(Vxyv=0 for any vector function v 
Vea Vj — 0 for any function f 


where V7A = 


1-3. Prove that if C is a closed curve, S is the surface bounded by C, and g is any 
function of space, then 


\ 


[ea=|o x Vo) dA 
(2 AY 
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1-4. 


1-5. 


1-6. 


1-7. 


1-9. 


Show that the magnitude of a-(b-x c) is equal to the volume of the parallelo- 
piped determined by a, b, and c. 


Let r be a vector from the origin to the point (x,y,z), and let r’ be a vector from 
the origin to the point (x’,y’,z’). Evaluate the following expressions in terms of 
randr’. 


——= 
|r —r’ 
1 
i 
OM 
1 
(c) er forr = r’ 
(d) V x ger forr # ¥’ 


Prove the following vector identity for a volume V enclosed by a surface S. 


[vx var= | ax vad 
V § 


where y is any vector function of space. (Hint: Examine the components of this 
equation and use Gauss’ theorem.) 


The trace of a second-rank tensor is the sum of all of its diagonal elements. 
That is, 


3 
1 


k= 
Show that Tr T is invariant under rotation. 


Show that the sum of squares of all the elements of a second-rank tensor is 
invariant under rotation. 


If S and T are two second-rank tensors, show that >» S;;7;; is an invariant under 
rotation. a 

In diagonalizing a symmetric tensor S, we find that two of the eigenvalues 
(A, and A.) are equal but the third (A3) is different. Show that any vector which 
is normal to fl; is then an eigenvector of & with eigenvalue equal to 4,. 


2 ) 
Principles of Electrostatics 


2-1 INTRODUCTION; COULOMB’S LAW 


Our study of electromagnetism begins very simply with Coulomb’s law. 
In the chapters ahead as we examine the beautiful structure of Maxwell’s 
equations, we must try to remember that it all began as a law of force govern- 
ing the interaction of two charged particles. It will be amazing to see how 
much ground we can cover on this one tank of fuel. 

As we all know, there is an attribute which exists in matter and which 
we have come to call electric charge. Some objects may have positive charge 
and some negative. Once a standard charge sign has been chosen, all other 
charges can be classified in sign by whether they are attracted to or repelled 
by the standard charge. Having done this, we observe that all pairs of 
charges with like sign repel one another and all pairs with unlike sign 
attract one another. This pseudosexual rule can be codified quantitatively 
28 
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by observing that the force of one charge on another is proportional in 
magnitude to the product of the charges and inversely proportional to the 
square of their separation. We then write down Coulomb’s law. 


F,,. = force by charge 1 on charge 2 


= 119212 — 14) 
lr. — r,|° 


(31) 


Of course we must set up a system of units, and we do so in the most natural 
way possible. We let distance be measured in centimeters and force in 
dynes. Equation (2-1-1) then serves to define the unit of charge, which we 
call the esu (electrostatic unit). Thus two equal charges, each of one electro- 
static unit, will pull (or push) on each other with a force of one dyne if 
they are set one centimeter apart. 

If we have a number of charges around, then the force on each by all 
the others can be obtained by simply adding the individual forces vectorially. 
If 91, 2, ---» G, are a set of charges at positions r,, 12, ... , F,, respectively, 
then the force on q; is 

R= DR 5 ae Q-1-2) 

iFi ie [hl 

We now introduce the notion of electric field. Having set our charges 
91> 925+ ++», into their positions r,;,...,¥,, we can place a very small 
charge at position r = (x,y,z). We then measure the force per unit charge on 
q in the limit as g approaches zero. (The limiting procedure is carried out 
to avoid disturbing the other charges.) The result is the electric field E at 


(x,y,Z). 


force on q at (x,y,z) 


E(x, y,z) = lim (2-1-3) 


q>0 q 
We have considered Coulomb’s law and defined E for a system of 
point charges. There are no point charges in nature; we shall have to deal 
with charge distributions over finite volumes. Hence we introduce the 
notion of charge density. We define the function p(x,y,z) to be the charge 
per unit volume at any point (x,y,z). Changing the sum in Eq. (2-1-2) to an 
integral and making use of our definition of electric field, we have then 


rr-r 
E(x,y,z) = E(r) = | BO ay (2-1-4) 
all |r au | 
space 
To develop a feeling for what we are doing in our integral, it is convenient 
to refer to Fig. 2-1. We consider a charged object located in space and a 
point given by the vector r at which we wish to evaluate the electric field. 
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Fig. 2-1 We obtain the field at 
r by summing the contributions 
of each element dV’. 


We break our volume up into little pieces dV’ and let the vector r’ go from 
the origin to dV’. The charge of dV’ is just p(r’)dV. The field at r due to 
this bit of charge is just 


ere 


Equation (2-1-4) is obtained by adding all the contributions together. 
As we perform the integration, r is, of course, kept fixed. 


2-2. THE DIVERGENCE OF E; GAUSS’ LAW 


The inverse square law governing the rate at which electric field drops off 
as a function of distance from its source has an immediate and important 
consequence. In a charge-free region the divergence of E is zero, and hence 
within such a region the flux of E into (or out of) any enclosed volume is 
zero. As we shall see, the flux of E out of any enclosed volume is just pro- 
portional to the total charge contained within the volume. This result, 
which is called Gauss’ law, provides an exceedingly powerful tool for the 
calculation of electric field in a case of high symmetry. We shall make 
considerable use of it. 

We wish then to evaluate V - E explicitly. Nothing could be simpler. 
Using Eq. (2-1-4), we obtain 


VEG) = v-[ Lessacals la 
all |r an | 
space 


We can next interchange the order of differentiation and integration. To 
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convince himself that this is a legitimate procedure, the student should 
carry out the same procedure with differences and sums. The difference 
between two sums of a set of terms is equal to the sum of the differences. 
We have then 


V-E(n) = | pe ay (2-2-1) 


Since p is not a function of the variables x, y, z with respect to which 
we are differentiating, we can rewrite this as 


Slee (2-2-2) 


r—r ] } 1 : 
rie a 


Evaluating the various terms in the above expression, we have 


1 _ Ser) 
gare Fo 
Vr — ry — Vr =3 
and hence 
r—r —3 3 
: pa ee ND 
See al 


We thus conclude that V - E is zero in any charge-free region, as long as r’ 
can never equal r. We need only consider then the integral over an infini- 
tesimal sphere about the point r = r’: 


r—r 


V-E(r) =| pV ay” 


sphere 
about r 


We can remove p(r’) from the integral since it does not vary much over the 
immediate neighborhood of r. We evaluate it at r. ’ 


r-r 


ey ae 


V-E(@) = oto | 


small sphere 
about F 


Now differentiating (r — r’)/|r — 1’|* with respect to x, y, or z is equivalent 
to differentiating with respect to — x’, —y’, and —z’. That is to say, 
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where 


, 0. 70" 0 
vray we) 
This gives us the result 


V-E(r) = —p(r) a 
sphere |r - 
about r 


By Gauss’ theorem [Eq. (1-4-3)] we convert this to a surface integral. 


r-r r-r 
V-E() = +p(r) oo rip et ed Yl 
7{3 - 
surface |r mm r’| lr a r’| 
of sphere 
about r 
dA’ 
= p(r) A) (4 
surface |r 16 
of sphere 
about r 


The integral on the right is just equal to 4x. Thus we have 
V-E() = 4zp(r) (2-2-3) 


This result is one of the four basic equations of Maxwell and lies at 
the foundation of electrostatics. It says, in effect, that electric field can 
only have a net flux into or out of a region if there is charge within the 
region. We use this to derive Gauss’ law by integrating over a volume V. 


| V-E(r) dV = 42 | p(r) dV (2-2-4) 
V V 


The left side of Eq. (2-2-4) can be transformed into a surface integral 
by means of Gauss’ theorem. 


[var | ena 
v s 


The right side of Eq. (2-2-4) is just 4x times the total charge Q within the 
volume. We have then 


| E-fdA = 40 (2-2-5) 
RY 


In other words, the total flux of electric field out of any given volume 
is just equal to 4x times the charge within the volume. 

Gauss’ law greatly facilitates the determination of electric field in 
situations characterized by high symmetry. We illustrate its application by 
calculating the field everywhere due to a spherically symmetrical, uniform 
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charge distribution of radius R and total charge Q. For convenience we 
set the origin of our coordinate system at the center of the sphere, as shown 
in Fig. 2-2, and let r be the distance from the origin to the point at which 
we wish to determine the field. Obviously, by symmetry, the field must be 
radial in direction with magnitude only dependent upon r. We set an 
imaginary spherical surface at radius r and observe that the total flux of 
E out of the volume enclosed by this surface is just 4ar7E(r). If r< R 
then the charge within the surface is just Qr?/R°. We have then 


E(r < R) = g 
(2-2-6) 
Er 2 R)= < 


2-3 A FEW WORDS ABOUT MATERIALS; CONDUCTORS 


At this point we must say a few words about the electrical nature of materials. 
Later, in Chap. 10, we will try to give a more detailed and extensive picture 
of what actually goes on at the microscopic level. For now we will be rather 
brief and somewhat incomplete. 

As we ali know, matter is constituted of positively charged heavy 
nuclei surrounded by negatively charged light electrons. The scale of 
physical dimensions of macroscopic bodies is determined by the natural 
radius of an electron cloud about the nucleus, about 10°*® cm. Hence 
about 1074 electrons are typically packed into a cubic centimeter of solid 
material and provide the bonds which keep it together. The outermost 


ax | 


Fig. 2-2. We use Gauss’ law to 
determine the electric field every- 
where due to a uniform spheri- 


cal charge distribution of radius 
Zz R. 
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electrons in each atom are relatively easy to move about and can act as 
charge carriers in conductors. 

Now, to get some physical feeling for the strength of electrostatics, 
we should point out that if every electron were removed from each of two 
1-gram pith balls and they were placed 1 cm apart, the force of repulsion 
between them would be about 107? dynes or about 107° tons! A more 
relevant illustration (suggested to the author by Prof. D. A. Bromley) is 
the fact that if all the electrons were removed from one-tenth of a cubic 
millimeter of the material on the nose cone of an Apollo rocket in some 
unspecified manner and brought down to the pad, the attraction between 
these electrons and the remaining positive charges on the nose cone would 
be sufficient to anchor the rocket firmly in place despite full thrust from the 
first stage. Remarkable indeed! We see thus that the fraction of available 
electrons which are involved when we charge or discharge an object is 
infinitesimal. 

Some materials have the property that all their electron clouds are 
quite strongly bound to the individual positive nuclei. In this case the applica- 
tion of a reasonable local electric field within the material serves only to 
stretch the bonds between electrons and individual nuclei and not to break 
them. As we shall learn shortly, the material is then polarized but no real 
current flows. Such materials are called dielectrics. 

Within some materials, called conductors, the application of an electric 
field causes the outermost electrons about each nucleus to move relatively 
freely from atom to atom and to continue moving as long as any field 
remains within the conductor. If our conductor is isolated, then the moving 
charge will pile up somewhere until it just neutralizes the applied field 
within the conductor. At that point no further charge will flow. (There is 
a tacit assumption here that the momentum picked up by the electrons 
as they accelerate can be ignored. This assumption is valid because each 
electron can only accelerate for a very short time before colliding with an 
atom.) 

The fact that the electric field within a conductor is zero in static 
equilibrium permits us to deduce that the charge density is zero within the 
conductor [see Eq. (2-2-3)]. Hence whatever charge has piled up must be 
on the surface of the conductor. 

Let us apply some of what we have just learned to the solution of a 
simple problem. We have two concentric conducting spheres, as shown 
in Fig. 2-3. The inner sphere has inner radius a and outer radius b. The 
outer sphere has inner radius c and outer radius d. We place a charge Q, 
on the inner sphere and a charge Q, on the outer sphere. We would like to 
know how the charge is distributed and what the value of electric field is 
at every point in space. We let a,, o,, o,, and o, be the unknown surface 
charge densities per unit area on the surfaces with radii a, b, c, and d, 
respectively. 
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Fig. 2-3 Two concentric conducting spheres 
have charges Q, and Q, placed on them. We 
wish to evaluate the surface charge densities and 
electric fields. 


Again symmetry tells us that the electric field must be in a radial 
direction and its magnitude can only depend on the radius from the center, 
r. The imaginary gaussian surfaces through which we evaluate the flux of 
E are, of course, spheres at various radii. We see immediately that E = 0 
for0 < r S asince there is no charge within the inner sphere. Fora < r < 3, 
we are within the conducting material, and so E = 0. Hence there can be 
no charge density on the inner surface of the inner conductor (o, = 0). 
All of Q, must appear on surface b, and we conclude that 


ne 
>» Arb? 
Between the two spheres we have 
4nQ Q 
Ebsrsd= Alas? ai 


Inasmuch as there is no field within the second conductor (c <r < d) 
the total charge on the inner surface of the outer conductor must be — Q,. 
Hence 


sais 


°  Anc? 


Finally the remaining charge on the outer conductor appears on its outer 
surface, leading to the result that 


92+ 91 


oo 4nd? 
The field outside the second conductor is thus 


Q,+Q, 
Be 


oO 


E(r 2 d) = 
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2-4 THE CONSERVATIVE NATURE OF ELECTROSTATICS; POTENTIAL 


We will now demonstrate that the electrostatic force is conservative. That 
is to say, the work done by it as we take a charged object from one point to 
another is independent of the path taken between the two points. (Another 
example of a conservative force is gravitation.) Alternatively, the work done 
by this force as we go around a closed loop is zero. The proof is very simple 
and rests upon an application of Stokes’ theorem, viz., 


$e-a= |v x E-fidA 


Thus we need only show that V x E = 0. Using Eq. (2-1-4), we have 


r-r 
V x E = [, ptr’) V x je cp 4 (2-4-1) 
space 
But 
ar. 1 . Vx(-r) 
Vx Fore Tein x(r—r)+ re 


ll 


NowV x (@®-—-r)=Vxr 
asr — r’, and hence 


0. Also, V(1/|r — r’|*) is the same direction 


V (r-—r)=0 


——_; x 
F-¥P 
We conclude that V x E = 0, at least as long as r is not equal to r’ at any 
point in our integral. To cope with the contribution from the neighborhood 
where r = r’, we again convert our volume integral into a surface integral. 
r _— 
vxe=[ Pee 
small |r a | 
sphere 
about r 


We can take p(r’) out of the integral and evaluate it at r, leaving us with the 
equation 


V x E=p(r) eT | 
sphere ; lr —Fr | 
about r 


Now, for any vector function v, we have the general identity 
| vx vay = | fix vdA (2-4-2) 
vol surface 


(This identity is easily proven by examining its components and using 
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Gauss’ theorem.) Changing V to —V’ and using (2-4-2), we convert our 
expression for V x E to a surface integral. 


Vx E= —p(r) i x ae (2-4-3) 
surface |r cra | 

of sphere 

about r 


The right side of (2-4-3) is zero since the normal ff’ is just parallel tor — r’. 
Hence, for the case of electrostatics, 


Vx E=0 (2-4-4) 

We conclude that E is conservative. The integral | E - dl between two 
points is independent of path. 

We can now choose an arbitrary reference point and evaluate | E - dl 
from any point r to this reference point. It is conventional to take the refer- 


ence point at infinity in the case of electrostatics. We define the function 
g(r), called the potential function, by the equation 


g(r) = | E- dl (2-4-5) 


If we evaluate the potential function g at two points A and B, then the 
work done by E per unit charge moved from A to B (if all other charges 
are kept fixed) is just g(A) — g(B). Letting point A be given by r and 
point B byr + Ar, we have 

g(A) — 9(B) = ot) — g(r + Ar) = E- Ar 
But 

g(r + Ar) — g(r) = Vo: Ar 
Since Ar is arbitrary, we conclude that 

Vo = -E (2-4-6) 


To summarize, the conservative nature of electrostatics permits us to 
set up a potential function g(r) defined for every point in space such that 
the gradient of ¢ is equal to the negative of the electric field. The power of 
this observation will become apparent when we discover that it is often 
easier to determine ¢ than it is to determine E directly. 

Let us evaluate the potential function at a distance r from a point 
charge q. 


o(r) = | ° Sr = = (2-4-7) 


If we have a distribution of charge p(r’), we sum the contributions to the 
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potential and obtain 


{ pr ) dV’ (2-4-8) 
ars |r — &'| 
space 

In the case of a conductor the absence of any internal electric field 
ensures that it is an equipotential. We might return then to the problem of 
the two concentric spheres with charges Q, and Q,, respectively (see Fig. 
2-3), and evaluate the potentials on these spheres. The potential on the inner 


sphere is obtained by integrating from radius 5 outward. 


g(b) = { E(r) dr + | . E(r) dr 
b a 


‘ 1 OL 2 
= I I Q,+ Q2 
a5 ae d 


The potential on the outer sphere is just g(d) = (Q, + Q,)/d. 

We now make a very simple application of what we have just learned 
to demonstrate that the electric field at the surface of a conductor is per- 
pendicular to the surface. Figure 2-4 shows a portion of that surface and 
the path for which we wish to evaluate $E - dl. The legs of the path which 
cross the surface are assumed to be of infinitesimal and negligible length. 


Path of integration for 
evaluating §E° dl 


Fig. 2-4 We evaluate ¢ E : dl for the path shown at 
the surface of a conductor. In order that the integral 
be equal to zero we must have no tangential electric 
field just outside the conducting surface. 
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Since there is no field inside the conductor, we conclude that the contribu- 
tion to the leg outside the conductor must be zero. Hence there is no tangen- 
tial component of electric field at the conducting surface. 

We can use Gauss’ law to obtain the magnitude of E at the surface 
of a conductor with surface charge density = o. We enclose the surface, 
as shown in Fig. 2-Sa, within a flat thin box of negligible thickness and area 
AA and evaluate the flux out of the box. 


Flux = (£)(AA) = 4ncAA 
Hence the field at the surface of a conductor is 


E = 4x0 (2-4-9) 


E = 2no 


Fig. 2-5 (a) We evaluate the field at a con- 
ducting surface in terms of o, the charge density 
per unit area. (b) A little section of the surface 
charge considered all by itself would give rise 
(b) to a field of 220 on either side of it. 
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If we now have a good look at our little area AA, we see that the 
surface charge on it (c), if considered all by itself, would give rise to a field 
of 2x0 pointing in on one side of it and 2x0 pointing out on the other side 
(see Fig. 2-55). That means that all the rest of the conductor must give rise 
to a field 2xo pointing out, just at the surface. On the inside of the surface 
this field from the rest of the conductor cancels the ingoing 220 from the 
little segment AA. On the outside of the surface the field from the rest of the 
conductor augments the field from the segment AA to give us the result of 
Eq. (2-4-9). 

Knowing how much of the field is contributed by the local surface 
charge o and how much is contributed by the rest of the conductor permits 
us to calculate the force on the little bit of surface area AA. We just multiply 
the local charge (cAA) by the field due to the remaining charges (270) 
and find 


F = 2no?AA (2-4-10) 


In other words, the surface of a conductor feels an outward-going pressure 
given by 
P= 2no? (2-4-11) 
Let us apply what we have just learned to a very simple problem. 


We will calculate the force between two charged conducting plates, each 
of area A and separated by a distance d (see Fig. 2-6). We will assume the 


2a 


Or» 


Fig. 2-6 Twoconducting plates of area A are separated 

by a distance d as shown. A charge Q, is placed on one 

and a charge Q. on the other. We would like to find 
Q: Q, the total force that one plate exerts on the other. 
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lateral dimensions of each plate to be large compared with d and the thick- 
ness to be small compared with d. We will also assume a charge Q, on the 
first plate and a charge Q, on the second plate. 

Referring to Fig. 2-6, we have taken the charge densities on the four 
relevant surfaces to be o,,, 01,, 024, and o,,, respectively. Of course, their 
values are subject to the constraint that 


O1g + O14, = <- 
(2-4-12) 
2 
Cag biOgy= Al 
We can next calculate the fields in each conductor in terms of o,,, 64,, 
O24, Fz, and then set these fields equal to zero. Each sheet of charge makes 
its contribution of 2x0 with a sign which depends on whether it is to the 
left or right of the region of interest: 


E in conductor | = 2n0,, — 220,, — 2n0,, — 210, = 
: : F (2-4-13) 
Ein conductor 2 = 210,, + 2101, + 2102, — 2202, = 0 
We can solve the four equations above for the four unknowns, and we 
obtain 


eee Q,+Q, 
la fr eat 
=i 
Ci a ag 21—= : 


To evaluate the force on each plate we make use of Eq. (2-4-11). 
We will take as positive a force to the right and as negative a force to the 
left. 


F, = force on plate 1 = —2n0,,7A + 270,,7A 


= = 200192 (2-4-15) 
A 
F, = force on plate 2 = —2102,7A + 2102,’A 
= 7212 (2-4-16) 


Just as we expected, we find the forces to be equal and opposite. The mag- 
nitude of the force, to the approximation used here, does not depend on d. 
Obviously, when d becomes large and of the same order as the transverse 
dimensions of the plates, this approximation will no longer hold. 


42 PRINCIPLES OF ELECTROSTATICS 


Incidentally, the electric field between the plates is given by 420,, or 
—4104,. 


Bees plates — sa (2-4- | 7) 


We will return to this rather simple example again as we develop more 
sophisticated techniques for evaluating the forces on collections of charges. 
In the meantime, we go on to a detailed study of techniques for determining 
the potential function ¢. 


2-5 SOME IMPORTANT THEOREMS ABOUT POTENTIAL 
FUNCTIONS; BOUNDARY CONDITIONS AND UNIQUENESS 


As we have already pointed out, finding the potential g(r) is a convenient 
way of determining the field. The fundamental differential equation obeyed 
by the function g can be obtained from Eq. (2-2-3) by substituting —Vo 
for E. 

62 9 ang 9 62 Q 
aa t Bet geo nae (2-5-1) 
(The operator V? = 07/dx? + d7/dy? + 6?/dz? is called the Laplacian.) 
We will now demonstrate a remarkable set of theorems which relate to the 
uniqueness of our solutions for @ subject to Eq. (2-5-1) and appropriate 
boundary conditions. 

As the first step along the way, we prove a simple lemma called the 
mean value theorem. The theorem states that in a charge-free region the 
average value of potential on the surface of any hypothetical sphere is equal 
to the value of the potential at the center of the sphere. 

Let our spherical surface have radius a. Let @ be the average potential 
on the surface. Then we have 


2 dA H 
= = % 4na” = 4n ke. ‘a (2-5-2) 


where dQ is the element of solid angle subtended by dA. Differentiating 
@ with respect to a, we obtain 


Vo= 


dp 1 dp 1 

da re 4n = éa a = 4n lanl ig = oe 
Substituting back again for dQ, we rewrite Eq. (2-5-3) as 

dp 1 1 ‘5 

ie = Vo -fidA = Bed I . V*g dV (2-5-4) 


of sphere 


Since V7@ = 0 in a charge-free region, we have d@/da = 0, and hence @ 
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does not depend on the radius of the sphere. Thus we have proven our 
theorem. 


Pcenter = Gla a 0) = G(a) (2-5-5) 


There is an obvious byproduct of this theorem, called Earnshaw’s theorem, 
which states that g cannot have a local maximum or minimum in a charge- 
free region. The proof of this theorem requires only that we take a small 
sphere about our presumed maximum or minimum. The potential on the 
surface of the sphere will presumably always be less than (or more than) 
its value at the center, violating the mean value theorem. 

We shall note here that the two theorems we have just proved relate 
to any function f(x,y,z) which satisfies the equation V7f = 0. We will make 
use of this fact in our further work. 

We next examine a situation in which we have a set of N conductors 
held at potentials g,, @2,..., My. In addition we have a charge density 
p(r) specified for all points outside the conductors. We will show that 
there is one and only one solution to the differential equation (2-5-1) which 
goes to zero at infinity and which is equal to the appropriate potential on 
each of the N conductors. 

One solution to the problem is, of course, given by Eq. (2-4-8), where 
we take care to integrate over the surface charge distributions on all the 
conductors. That this is the only solution is not so obvious. 

Let ¢,(r) and ¢,(r) be two solutions which satisfy the boundary 
conditions and the differential equation (2-5-1). Then, letting f = 9, — 9,, 
we have 


Vf = V*o0, — V’o, = —4np + 4np = 0 


Therefore f has no minima or maxima anywhere. Since f is zero on the 
surface of every conductor and also at oo, it must be zero everywhere. 
Thus g, = @, and our theorem is proved. 

Instead of fixing the potentials at each conductor, we might set each 
total charge to a definite value. Thus the NW conductors would have charges 
Q,,Q2,...; Qn, respectively. In addition the charge density outside the 
conductors is again taken as p(r). We will demonstrate that the solution 
for g which satisfies Eq. (2-5-1) and leads to the appropriate values of Q; 
is unique. This is a very powerful result since it implies that the charge Q; 
will arrange itself in a unique way on the ith conductor. Again the proof 
is carried forth by first assuming that g, and g, are both solutions to our 
problem and then examining f = 9, — 9}. 

We remember that the field at the surface of a conductor is just equal 
to 4ncfi. Hence we have 


0, = | o, dA, = = | Vo: fdA, (2-5-6) 
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If both g, and 9, satisfy Eq. (2-5-6), then f satisfies the equation 
| Vf- tid A; = 0 (2-5-7) 


Just as before, V?f = 0 outside the conductor and f = 0 at oo. Again 
Earnshaw’s theorem requires that f have no maximum or minimum outside 
the conductor. 

Suppose for a moment that f is not zero on every conductor. Then there 
is one conductor for which f is either the most negative or the most positive. 
Since no minimum or maximum can exist outside the conductors, this 
particular conductor must be an absolute extremum. Hence V/- fi must 
have the same positive or negative sign at all points on the surface and 
Eq. (2-5-7) would be contradicted. We conclude that f must be zero on all 
the conductors. Again this leads us to the result that f = 0 everywhere. 

Now, of what use are these uniqueness theorems anyway? The answer 
is very simple. There are many general techniques for solving Eq. (2-5-1) 
subject to specific and complete boundary conditions. Once a solution 
has been found which satisfies the complete set of boundary conditions, 
our job is done. We need to look no further; we have the only possible 
solution to the physical problem at hand. Furthermore, if by a bit of clever- 
ness we can see the answer quickly, all the more power to us. 

To see the possibilities inherent in this approach let us solve a problem 
which would seem well nigh impossible if handled in a routine way. We 
place a large sheet of conductor on the yz plane and ground it (g = 0 for 
the conductor). We then bring a charge q to the point x = a, along the x 
axis (see Fig. 2-7). We would like to find the field at all points with x => 0, 
and we would like to know how much force is exerted on the charge. 


g=0 


Fig. 2-7 A charge q is placed at a distance 
x=0 a from a grounded plane. 
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What exactly are the requirements that must be met by ¢ in the region 
x 2 0? First, V?@ must be zero everywhere except at the charge itself. 
Second, g = 0 at x = 0 and at x = oo. 

Suppose we were to examine an entirely different problem, where the 
conducting sheet was removed and replaced by a charge —g on the x axis 
at the point x = —a. This problem would be simple to solve. But the most 
obvious fact about the new problem is that the potential g is zero at x = 0 
and corresponds to the same charge distribution for x > 0 as we had 
previously. Hence it is exactly the potential we seek. The field in the region 
x 2 0 is then also the same for the two problems. Finally we conclude 
that the force on a charge qg at a distance a from a grounded plane is just 
q’/4a* toward the plane. This technique for finding ¢ is called the method 
of images. 

We will very shortly come back to potential theory and develop some 
techniques for determining potentials. In the meantime, though, we go 
back to charge distributions and the nature of matter to investigate the 
notion of electric dipole moment and polarization. 
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Most matter is electrically neutral but with the property that an applied 
electric field can cause some polarization. This can happen in one of two 
ways. If the atoms are basically symmetrical, then they can be unbalanced 
or polarized to the extent that the applied electric field can compete with the 
atomic fields. Alternatively, the molecules themselves may be constructed 
in such a way that they are not electrically symmetrical. In that case, if the 
molecules are free to rotate, as in a liquid, they can be “lined up” by the 
applied field. The extent of alignment will then depend upon the temperature 
because the molecules are continually being depolarized by collisions. 
Figure 2-8 illustrates these two types of behavior. 

In any case, we deal with the field-distribution characteristic of the 
electric dipole. Let us begin by examining the field due to two equal and 
opposite point charges set a small distance / apart. We draw a vector I 
from the negative to the positive charge, as shown in Fig. 2-9, and let r 
be the vector from the negative charge to the point P at which we wish to 
evaluate the field. We let r’ be the vector from +g to P. We will assume 
throughout that |/| < r and hence that terms of the order of /?/r? can be 
ignored. Clearly, we can now write 


1 1 
or) = 4 G = +) (2-6-1) 
Now 
a (2-6-2) 
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Symmetrical Asymmetrical 
atoms dipoles 


No field =) 
= a5 


Applied at uF 
field 6) OF 


Fig. 2-8 The application of an electric field will polarize matter by either polarizing the 
molecular charge distributions or aligning the permanent molecular dipoles. 


and hence 


r=/P+P—2-l (2-6-3) 


Ignoring terms of order /?/r?, we rewrite Eq. (2-6-3) as 


, 2r- 1 
r= Wie 2 
1 
ay ( = ) (2-6-4) 


Substituting back into Eq. (2-6-1) and remembering that 
r-] 1 


1 - —- = ——— 
r? l+(r-I/r’ 
we find 
+ql-r 
g(r) = 5 (2-6-5) 


We now define a new vector p which we call the dipole moment of this system. 
p=a (2-6-6) 
We finally have then 


g(r) = —p: v— (2-6-7) 
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[2 
r’ 
+4q 
r 
l . 

Fig. 2-9 We wish to evaluate 
the field due to two closely spaced, 

—4q equal, and opposite charges. 


The important thing to note here is that the potential drops off as 
1/r?, and hence the field drops off as 1/r°. It is interesting to examine the 
field distributions qualitatively. We note first that @ = 0 for all points in 
the equatorial plane about p (such that p-r = 0). Above this plane, in 
the direction of p, g is positive. Below this plane ¢ is negative. Hence, in 
the equatorial plane the gradient of g must be in the direction of p and 
the direction of E must be opposite to p. Just along the line containing p 
we find the field in the same direction as p. A picture of this field distribution 
is shown in Fig. 2-10. 

We next examine the more realistic problem of a spatially confined 
charge distribution given by p(r’) such that the net charge of the distribu- 
tion is zero. An example of this is, of course, the atomic system. We are 


Fig. 2-10 The field distribution due to an electric dipole p. 
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again interested in the field at a distance from the charge distribution which 
is large compared with its size. We shall show that to first approximation 
this field looks like a dipole field. 

The charge distribution is shown in Fig. 2-11. The potential g(r) at 
position r is given by Eq. (2-4-8): 


g(r) = | a (2-6-8) 
charge 


distribution 


To first approximation, noting Eq. (2-6-4), we have 
1 a 
Lo - (1 pik ‘) (2-6-9) 


Substituting back in Eq. (2-6-8), we have 


g(r) = +| p(t’) dV’ + | pl’) dV’ (2-6-10) 
r charge r charge 


distribution distribution 


Remembering that the net charge is zero, we remove the first term on the 
right of Eq. (2-6-10), leaving us with the old result for a simple dipole: 


1 
= —-p:-V— 
g(r) Pp . 


where 


p= | pr’)r’ dV" (2-6-11) 
charge 


distribution 


We see that any “neutral”? charge distribution can be represented, to first 
approximation, by a dipole whose dipole moment is given by Eq. (2-6-11). 
That this integral reduces to Eq. (2-6-6) in the case of two equal and opposite 
point charges is obvious. 


Fig. 2-11 We integrate over a 
charge distribution with no net 
charge to find the dipole field at 
large distances. 
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Now, if we look into a bit of matter, we see immense numbers of these 
little dipoles, and it becomes necessary to talk about dipole moment per 
unit volume. We define the vector field P(r) such that for a little bit of volume 
dV at position r the dipole moment dp will be given by 


dp = PdV (2-6-12) 


We now wish to find the potential g(r) which results from a distribu- 
tion of free charge p,(r) and dipole moment per unit volume P(r). By free 
charge we mean the charge which is not paired off as part of a dipole. Using 
Eqs. (2-4-8) and (2-6-7), we write 


prt’) dV’ | 1 , 
r) = a = P(r’): V——— av (2-6-13) 
of) \, [r = r’| all Ir ae 
space 
Remembering again that 
ee eget 
lr—r jr — ¥’ 
we have 
2 Vv’ P 7 
g(r) = | tlw “ | Vv’ F (r dV’ 
all Ir mul | all Ir Sr 
space 
zs | Laer Ue) dV’ (2-6-14) 
all Ir Salk | 


The second term on the right-hand side of Eq. (2-6-14) can be transformed 
into a surface integral at infinity. Since P = 0 at infinity, we can write 


eee =| El): 04 di (2-6-15) 
all Ir Sly | surface |r a 


space 


Hence we write 


prt’) : | ppt’) : 
= — dV’ + —_ dV (2-6-16) 
on) {, lr —r'| all t— 
space 
where 
ee (2-6-17) 


We conclude that the entire effect of polarization can be taken into account 
by introducing a polarization charge equal to —V- P and calculating as 
though this were a real charge. 
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Wecan now go back to our basic differential equation for E, Eq. (2-2-3), 
and separate p into its two parts. 


~-V-E = 4p 
= 4n( pr + pp) 
= 4n(p, — V- P) (2-6-18) 
Rearranging terms, we have 
V-(E + 4xP) = 429; (2-6-19) 
We define a new vector field D everywhere in space by the equation 
D = E+ 42P (2-6-20) 


Substituting back into Eq. (2-6-19), we obtain the appropriate Maxwell 
equation in the presence of matter. 


V-D = 420; (2-6-21) 


We conclude then that the free charge density p; can be regarded as a local 
source of D in the same way as the total charge density p can be regarded 
as a source of E. That is where the similarity ends, however. Specifying the 
total charge density everywhere determines E completely by means of 
Eq. (2-1-4). Specifying p- does not determine D completely. For example, 
a block of permanently polarized material with no free charge can certainly 
give rise to an electric field near it. Hence we would have a D field in the 
complete absence of any pp. The magnitude of D would depend on the 
magnitude and distribution of P. 

Returning to Eq. (2-6-20), we note that there is often a linear relation- 
ship between the electric field at a given point and the polarization at that 
point. We can then write 


P = yE (2-6-22) 


where x, called the electric susceptibility, is in general a second-rank tensor. 
In the simple-minded case where x is just a number, it is usual to write 


D = &E (2-6-23) 
where é, called the dielectric constant, is given by 
eé=1 + 4nz (2-6-24) 


We can establish some very simple rules governing the behavior of 
the E and D fields at the boundary between two regions of differing di- 
electric constant. We assume that no free charge is present at the boundary. 
Figure 2-12 shows a portion of such a boundary between regions I and II, 
respectively. 
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II 


Fig. 2-12 We determine the behavior of E and D at the 
interface between two dielectrics. 


_— 
——— 


7 


We first place a flat thin volume across the boundary and apply 
Gauss’ theorem. Since there is no free charge within the volume, the flux 
of D out of the volume is zero. The normal components of D on either side 
of the boundary are thus equal. 


D,:fA =D, fi (2-6-25) 


We then take a path of integration shown and evaluate the line integral 
J E-dl around the path. Since it must be equal to zero, we conclude that 
the tangential components of E must be the same on either side of the 
boundary. 


a = ae (2-6-26) 


These rules will make it possible to calculate the direction and magnitudes 
of E,, and D,, from the direction and magnitudes of E, and D, if we are given 
the two dielectric constants. 


2-7 THE ENERGY OF A CHARGE DISTRIBUTION 


We will now determine the energy of an electrostatic charge distribution. 
In doing so, we will at first ignore the energy corresponding to the forma- 
tion of the dielectrics themselves but evaluate only the energy which is 
added to the system through the introduction of the free charge. The energy 
of the system will be considered to be zero when all free charge has been 
removed to infinity and only the dielectrics and conductors are left in 
position. All polarizations will be assumed proportional to the applied 
fields. 
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Let us take p, and ¢ to be the final distributions of free charge and 
potential, respectively, after our system has been completely set up. If 
we had somehow decided to place a factor of a as much charge everywhere, 
then our potential would be changed by the same factor a. That is to say, 
if the charge distribution p, leads to the potential distribution g, then the 
charge distribution ap, would lead to the potential distribution ag. (We 
must repeat that this depends upon the assumed Jinearity of the dielectrics.) 

Now let us bring in the charge from infinity, a little bit at a time. 
That is, let « go from 0 to 1, in infinitesimal steps. As we go from ato « + da, 
we have 


dU = increase in energy 


= | (ap)(p da) dV (2-7-1) 
all 


space 


since a@ is the potential to which the additional bit of charge p, dx dV 
is brought. The final energy is then 


1 
u= | ade | Opp dV 
0 all 
1 


space 


= Prp dv (2-7-2) 
all 
space 
If we have conductors present, then ¢ is a constant within their boundaries 
and we have 


1 1 
Ces. prod + — ¥ 9:9: (2-7-3) 
quiside cond 


where Q; is the charge of the ith conductor. 
We can rewrite U in a somewhat different form. Let us remember 
that V- D = 4zp,. Hence, going back to Eq. (2-7-2), we can write 


1 1 
U = — pro dV = — (V - D)g dV 
2 Jan 8x Jan 
space space 
= — [V -(@D) — Vo: D] adv 
87 Jan 


space 


I 1 
al gD-idA+—| D-Eav (2-7-4) 
8x Sea all 
space 
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where S,, is a surface at infinity. But D goes down as 1/r? and @ goes down 
as 1/r. Hence 


oD-fidA > 0 asr— co 


JSa 
We are then left with 
1 
Uses D-E ae 
ln dV (2-7-5) 


space 


At this point we might ask what the expression for energy would 
look like if we really knew what was going on in the material and chose to 
include the energy of formation of the dielectric into our tally. We would 
then have no need to introduce D at all but would use only the total charge 
density p and the electric field E. The energy of the system would then be 
written simply as 


1 
Urorat = Sa IF E? dV (2-7-6) 
ciace 
or 
1 
Usotat = De {, Pe dV (2-7-7) 


space 


In general, Eq. (2-7-6) will give a smaller value for the energy than Eq. 
(2-7-5) because D is usually larger than E. This is because the dielectric 
itself has a net negative energy corresponding to the binding of negative 
and positive charges. 

We now apply these results to a very simple example. We will calculate 
the energy stored in a parallel-plate capacitor, with no dielectric present, 
and then make use of this to determine the force between the plates (see 
Fig. 2-13). We assume a charge of + Q on one plate and — Q on the other. 


Fig. 2-13 We calculate the energy stored 
i in a parallel-plate capacitor. We again find 
the force between capacitor plates but this 
—Q time by considering the change in energy as 
we reduce / while keeping Q constant. 
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The area of the plates is A and their separation is /. By our first method 
we have 


1 1 


where Ag is the potential difference between the plates. 


4nQl 
we Bae 
A 
Hence 
2 
U= a t (2-7-8) 


By our second method 


uae 1 (4nQ\? 2nQ71 
== == | 
U Bn E* (volume between plates) al 74 ) A A 
We note that the stored energy decreases as the separation between plates 
decreases. Since all the stored energy is in the gap, this tells us that the 
force between plates is 


(2-7-9) 


with the minus sign indicating that the force tends to decrease /. This is 
in agreement with the result obtained earlier [see Eq. (2-4-15)]. If our 
plates were attached to a battery (Ag constant), the situation would be 
somewhat different. The energy between the plates would be 


__1 (Ae) ,, _ (Ag)?A 
u~3-(*) ‘ake: 8x1 


Here the energy stored between the plates increases as the gap decreases. 
Indeed, the change in electrostatic energy for a given small displacement 
is equal and opposite to that given by Eq. (2-7-9). Obviously the total energy 
stored in the system must still decrease as before, when / is decreased (since 
the force is just as attractive as before). Thus the battery runs down by an 
amount equal to twice the mechanical work done by the fields. 


2-8 THE GENERAL THEORY OF CAPACITANCE 


We will investigate the notion of capacitance in a rather general way, 
beginning with one or two conductors but then extending ourselves to an 
indefinite number of conductors. This will give us a tremendous amount of 
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insight into the linear nature of electromagnetism in addition to providing 
powerful techniques for the solution of specific problems. 

We start with one of the most important of all principles in physics 
and one which we will encounter many times when we deal with quantum- 
mechanical systems. This is the superposition principle. Within the present 
context the principle states that if a charge distribution p, leads to a potential 
distribution g, and a charge distribution pz leads to a potential distribution 
(zp, then the charge distribution p, + pz will lead to the potential distribu- 
tion @, + gg. (We must be careful here to use the entire charge density 
if we want to be rigorously correct. Otherwise nonlinearities in the dielectric 
will lead us into difficulty.) We need only go back to Eq. (2-4-8) to see that 
this principle is obviously true. The value of this principle will become 
apparent in cases where the solutions to V?9, = —4zp, and V7, = 
—4np, are easier to come by than the solution to V?9 = —4n(p, + pp)- 
We will then solve for g, and g, and add these solutions together to obtain 
Q. 

Let us consider first the case of only one conductor in space and 
nothing else (no other charge). If we place an amount of charge Qy on the 
conductor, its potential will rise to some value go. An amount of charge 
“Q, would lead to a potential apy. Clearly then there is a proportionality 
between the charge on the conductor and its potential which we can express 
as 


QO = C@cona (2-8-1) 


The proportionality constant C is defined to be the capacitance of a single 
conductor. In the case of a conducting sphere of radius R, where the poten- 
tial is just given by 

Psphere a z (2-8-2) 
we see that the capacitance is just R. 

It is amusing at this point to calculate the capacitance of the earth. 
Its radius is roughly 6500 km or 6.5 x 10% cm. This is not a very large 
number. Indeed, converting to “practical” units, we would find it to be 
equal to only about 700 microfarads. Alternatively, we can observe that 
if we removed the electrons from only a gram or so of material, we could 
raise the potential of the earth by 140 million volts. This in turn would lead 
to an electric field at the surface of about 0.2 volt/cm. Clearly then the 
positive and negative charges on the earth are very well balanced indeed, 
and it takes rather little to disturb this balance. That is why the flood of 
charged particles coming from a solar flare plays such havoc with radio 
communications on earth. 

We next consider the case of two conductors where one has charge 
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+Q and the other has charge —Q (see Fig. 2-14). Let g, be the potential 
on conductor 1 and @, be the potential on conductor 2. If we multiply Q 
by a constant factor, then both g, and @, will be multiplied by the same 
factor. Hence g, — @, is proportional to Q; that is, Q = C(g, — @2). 
The proportionality constant C is the capacitance of a two-conductor 
system. 

It is clear that the linearity of electrostatics permits us to specify 
important electrostatic properties of conductors which are quite indepen- 
dent of their charges cr potentials but depend only on the geometry of the 
system. We generalize now to a system of N conductors and no extraneous 
charge. Let the charges on the conductors be Q,, Q,, Q3, -..,0;,..%, 
Oy. The potentials will then be @,, @2,...,Q;,..., Oy. We seek a way 
of specifying Q;,..., Oy in terms of g,,..., @y, dependent only upon 
the geometry of the situation. 

We begin by solving a subsidiary problem. Let us ground all conductors 
except the first one. That is, let g; = 0 for i # 1. For this first subsidiary 
problem we will have a charge Q,(1) on the ith conductor. If every charge 
were to be multiplied by a factor a, then every potential would be multiplied 
by the same factor «. All those for i + 1 would, of course, remain zero. 
The potential on the first conductor would become equal to ag,. Hence 
we can write 


Q,(01) = Cii1¢ 
Q,(1) = Cr 
Q3(1) = C3191 


Ce 


On(1) = Cyr (2-8-3) 


+Q 


Fig. 2-14 Two conductors, one with 
-@Q charge +Q and one with charge —Q. 
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We now solve a second subsidiary problem. We ground all conductors but 
the second and set the second to g,. We have then 


Q,(2) = Cy292 
Q,(2) = Cr22 


On(2) = Cy2@2 (2-8-4) 


It is clear that we can go on like this and solve a total of N such subsidiary 
problems. 

We now recall the superposition principle. If we add together the 
solutions to subsidiary problems 1 to N, we will have the solution to the 
problem where each conductor i is raised to its appropriate potential 9,. 
Thus 


Qi = Cy 91 + Crag. ++ °° + Cyn Qn 
or 
N 
QO; = Sy CQ; (2-8-5) 
j=l 


The numbers C;; (called coefficients of capacitance) tell us what we want to 
know about the geometry of the situation. 

In physical terms then, if we can determine the full set of C,, for any 
given conductor configuration, we will be prepared to determine immediately 
the full set of charges Q,,..., Qy from the given set of potentials ,, 

..,@y. Of course, if we are given the charges Q,,...., Qy, we can 
determine the potentials by means of N simultaneous equations. We now 
wish to prove that C;; = C;;. To do this, we will first prove a more powerful 
theorem. 

Green’s Reciprocity Theorem. Let charges Q,,Q2,..., Qy correspond 
to potentials @,, ~2,..., @y. Let charges Qj, Q3,..., Qy correspond to 
potentials 9}, 93,..., Oy. Then 


ps 9:0; = ps 9 Q: (2-8-6) 
Proof: We first note that 
Q; = | 6;(r;) dA; (2-8-7) 
3; 


Q; = | a; (r;) dA; (2-8-8) 
S; 
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Using Eq. (2-4-8), we derive 


or) = » ey 


Qj = , 
j=0 JS; re : 
= constant depending only oni (2-8-9) 
: o;(r;) ; 

9; = Oo) = by ar dA’, (2-8-10) 

S; | i j 

We have then 
0:9; = | 6;(r;) a4, fot (2-8-11) 
JSi 


But @; is independent of r; as long as r; is at a point on the conductor. Hence 
we can bring it into the integral 


0:9; = | o,(r;) 9; (r;) dA, (2-8-12) 
S; 
and 
oors a | SDD ay a, (2-8-13) 
i 1 i eer 
Also, similarly 
ee 7 
t J j i 


We can now interchange the dummy indices i and j as well as the primed and 
unprimed variables. We see then that the two expressions are identical. Thus 


N N 
x 9iQ; = x 99; ; (2-8-15) 


To prove that C;; = C;;, we take as our two cases: 
1. Starred case: Let 
9; = ~~ allother g, = 0 
Then 
Q; = CijPo 
2. Unstarred case: Let 


Q; = Po all other g; = 0 
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Then 
QO; = CjiPo 
Green’s reciprocity theorem then tells us 
2, Xi = 059; = Qo = CiiPo* =) QO, = 0; 9; a CijPo7 


Hence 
= (oF (2-8-16) 


We observe one of the physical consequences of this statement in 
the case of two conductors. If conductor | is placed at potential g) and 
conductor 2 is grounded, a certain charge Qy will be induced on the latter. 
If, on the other hand, conductor 2 is placed at potential gy and conductor 1 
is grounded, the same charge Qy will now appear on conductor 1. 

It is instructive to calculate explicitly that this is so in the case of two 
concentric conducting spherical shells, one of radius a and the other of 
radius b (a < 5). 

Situation 1. Inner sphere a is at potential @ . Outer sphere 6 is at 
potential 0. Let Q, = charge on inner conductor. Then we calculate the 
potential on the inner conductor in the usual manner: 


b 
= 0 ,... NOm 1; 
v= | Ga-2 b 


Solving for Q,, we have 


ab 
OP ae b —a Po 
Since E = 0 outside the outer sphere, we must have Q, = —Q, and 
thus 
onan” (2-8-17) 
a Po 


Situation 2. Inner sphere a is grounded. Outer sphere b is at potential 
(Qo. The potential at r = a is equal to 0. This means that we can write 


b 
Q, 
| > dr + Qy = 0 


0,=—— 96 (28-18) 


Thus Q, (situation 1) = Q, (situation 2). 
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As we mentioned earlier, we could solve the set of N equations (2-8-5) 
and obtain the g’s in terms of the Q’s. We would then obtain the set of 
equations 


Qi = 5 P,Q; . (2-8-19) 
j= 
where the P,;’s are called the potential coefficients. We can again make use of 
Green’s reciprocity theorem to prove that P,; = P;.. 
1. Unstarred case: Let 
QO; = Q all other QO; = 0 
Then 
Qi = Pj Qo 
2. Starred case: Let 
Q;=Q, allotherQ;=0 
Then 
Q; = P;i Qo 
Substituting into (2-8-15), we have 
2, Qo = 09; = PQ” 
2 Qe = Q;9; = P.;Qo” 
and hence 
1 a (2-8-20) 


The physical content of this statement is quite remarkable. Imagine that 
we have a set of conductors. If we put a charge Q, on the fourth one, for 
example, and no charge on any of the others, we will produce a certain 
potential y, on the seventh one (for example). If the same charge 0, were 
put on the seventh one instead (all others uncharged), the potential on the 
fourth one would now be gs . Thus there is a profound, reciprocal relation- 
ship among the charges and potentials on all the conductors. 

Making use of Eq. (2-7-3), we have for the energy of a set of charged 
conductors 


] 
U=>) 0; 


1 
=5 1 Ciei9; (2-8-21) 
tJ 
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1 : 

U= oy » Pj Q; Q; (2-8-22) 

We can prove a rather nice theorem now. Let 6Ug be the change in 
electrostatic energy of the system if the conductors are slightly displaced 
while the charges are held constant. Let 6U, be the change in electrostatic 
energy of the system of conductors if their displacement is the same as 
before, while the potentials are held constant. Then we will prove that 
6Ug = —dU,. 

First we treat the constant-potentia! case: 


6U, = 10(¥ co.) 


= y (6C;,) 9:9; (2-8-23) 


since ~;, p; remained fixed. To treat the case of fixed charge, we note that 
_ in this case 


60; =0= a( Cyo,) 
J 
= De (0C;) 9; + aD C59; 
j j 


Hence 
Y C69; = —Y 6C,)9; (2-8-24) 
i j 


Now for the fixed-charge case 


l 1 
6Ug = > Ds (5C;,)9:9; + ze C (59) 9; 
ij \ 


1 
5 > 2S C;9(9@)) (2-8-25) 
ij 
We first interchange the dummy indices i and j in the last term on the right 


side of Eq. (2-8-25). We then remember that C;; = C,;. Thus the last two 
terms on the right side of Eq. (2-8-25) are equal, and we can write 


1 
6U9 = > ¥ (6C;)¢:9; + ¥ C,;(60)¢; (2-8-26) 
i,j ij 
Using Eq. (2-8-24), we have 


1 
6Ug = > > (6C;) 9:9; — 2, OC), (246-27) 
ij ie 
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6Ug = 6U, — 26U, = —6U, (2-8-28) 


Our theorem is proved. 

The physical consequences of this theorem are quite interesting. 
Imagine that we have a given arrangement of conductors with charges and 
potential on them. If the conductors are isolated and unconnected to the 
outside world, then the charges will remain fixed in any displacement. If 
6Ug is positive, then we must do mechanical work of magnitude 6 Ug against 
the electrostatic forces to perform the displacement. Now if, on the other 
hand, the conductors were connected to a set of batteries which maintained 
the potentials fixed, the change in electrostatic energy for the same displace- 
ment would be 6U,. Since the forces are the same as before, depending only 
upon the charges and positions of the conductors, the mechanical work 
that we do to carry out the displacement is exactly the same as before (6Ug). 
Since a positive 6Ug implies an equal negative 6U,, we must be putting an 
amount of energy 26Ug into the battery when we perform the displacement. 

An example of this was discussed earlier in the case of a parallel- 
plate capacitor (see page 54). 
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As we have already surmised, much of our study of electrostatics must be 
devoted to the development of techniques for the solution of Laplace’s 
equation (2-5-1) subject to appropriate boundary conditions. In general 
these boundary conditions will include the specification of either the total 
charge or the potential on each conductor. The potential at infinity will 
be assumed to be zero. 

We will soon see that only a limited number of problems can be 
solved in closed form or in terms of a simple power series. These are, in 
general, problems that exhibit a great deal of symmetry about a point or 
a line and hence call naturally for the use of cylindrical or spherical co- 
ordinates. Accordingly we begin by developing the important differential 
forms (gradient, curl, and divergence) in these coordinate systems. 

The cylindrical coordinate system, illustrated in Fig. 2-15, makes 
use of the three variables r, 0, z to specify the location of a point in space. 
The coordinate r is just the distance from the point to the z axis. The angle 0 
is the projected azimuthal angle as measured from the x axis and z is, of 
course, identical to the usual z coordinate. The coordinates (r,6,z) and the 
old coordinates (x,y,z) are related through the transformation equations 


r=./x*>+y* or x=rcosé 


(2-9-1) 
tan 0 = y/x or y=rsin@ 
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x 


Fig. 2-15 Thecylindrical coordinate system makes use of the variables r, 
6, z to specify the location of a point in space. 


It is convenient to set up three unit vectors f, 6, and k at a point of 
interest. The unit vector f is drawn in the direction of increasing r, and the 
unit vector @ is drawn in the direction of increasing 6, and naturally the 
vector k is drawn in the direction of increasing z. The three form an orthog- 
onal coordinate system with f x 6 = k. 

We note immediately that the new triad of unit vectors can be related 
to the old triad i, j, and k through the transformations 


f = cos 0i+ sin Oj or i = cos Of — sin 96 
6=-sindi+cosOj or j=sinOf+cos06 
k=k (2-9-2) 


Proceeding first to find an expression for V¢ in this coordinate system, 
we write, using Eq. (2-9-2), 


_ 99; 09 3 0@ + 


ieee a5! > aa 
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0g op. 0p 0p, A 
(2 cos 8 + By sin a)e + & cos 6 Ag sin 


+ 2% 29-3) 
Oz 


We remember though that dg/0x and d@/dy can be rewritten as 
6p _ 0p Or dp 080 0g Oz 


dx or ox 00 dx @z Ox 


(2-9-4) 
ae ee ae 
Oy ~ O@r Oy 06 dy dz oy 
Using Eq. (2-9-1), we find that 
or 00 —sin 6 
—— = cos 0 = = 
Ox r 
ot wien, “SOS geen? (2-9-5) 
0 oy r 
ae ae 
Ox Oy 
If we substitute these results back into Eq. (2-9-3), we finally have 
dy 1 (00 \, 0 \. 
= (—— —|{— =< 2-9-6 
vo (Ser + (Bho + (Fe (2.9.6) 


The identical result can be obtained immediately if we think of the 
meaning of V@ in physical terms. The three components of Vg are the 
rates of change of ¢ with respect to distance in the three mutually perpendicu- 
lar directions given by f, 6, and k. Going along f first, we noted that the 
rate of change of g with respect to distance in that direction is just 0@/dr. 
Going along 6, we see that a differential distance in this direction is just 
r dQ. Hence the rate of change of @ with respect to distance in the @ direction 


7] ; ; : 
= Finally, the rate of change of g with respect to distance 


in the z direction is 6g/dz. Equation (2-9-6) is then confirmed directly. 
We next consider the divergence of a vector function F as expressed 
in cylindrical coordinates. We could begin with cartesian coordinates and 
convert all derivatives appropriately. It is more elegant, however, to just 
calculate the flux out of a small volume element dV and set the result equal 
to V- F dV. As our volume element dV, we choose that which is bounded 
by the coordinates r and r + dr, @ and 0 + d@, z and z + dz, as shown in 


1 
is given by -s 
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Fig. 2-16. We indicate the bounding surfaces of dV by the numbers () to ©. 
Surface ( is at radius r and is bounded by @, 6 + d6, z, and z + dz. 
Surface @) is at radius r + dr and is bounded by 0, 6 + dé, z, and 

z+ dz. 

Surface @ is at height z and is bounded by r, r + dr, 0, and 6 + dé. 
Surface @ is at height z + dz and is bounded by r, r + dr, 0, and 

6 + dé. 

Surface 6) is at angle 6 and is bounded by z, z + dz, r, and r + dr. 
Surface @ is at angle 9 + d@ and is bounded by z,z + dz,r,andr + dr. 
The areas of the various surfaces are 


dAg@ = rd0dz dAg = dA@w = rdrdé 
dA g = (r + dr)d6 dz dAg = dA@ = drdz 


(2-9-7) 


The fluxes out of the volume are then 
Flux out through @ = — F(1)r dé dz 
Flux out through @ = + (2)(r + dr)d6 dz 
Flux out through @) = — F,(3)r dr d@ (2-9-8) 
Flux out through @ = +£,(4)r dr dé 
Flux out through 6) = —&(5)dr dz 
Flux out through © = +(6)dr dz 


Fig. 2-16 We wish to find the 
flux out of an infinitesimal vol- 
ume dV shown above. 
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Remembering that _ 


Ho) — Fay oF dr 


B(4) — FG) = 5 de 


FA(6) — F(S) = <2 ao 


we have (ignoring all terms which are the product of four differentials) 


Net flux out = of 
+ (+ Fe) rar ad de (2-9-9) 
r 060 
The volume dV is just r dr dO dz. We can then rewrite Eq. (2-9-9) into the 
form 
1 o luck, .. OF, 
ee ae -9-10 
Net flux out F a (rF, , 30 ae :Jav (2-9-10) 
The divergence of F is thus given by 
1 1 — OF, 
Veni =e a) + — 30 + ae (2-9-11) 


Inserting our expression for Vg [Eq. (2-9-6) into the above, we find that the 
laplacian is given by 


1 afe 1 ao 2 
V0 =V-Vo = ~ x? =) oS sat “ = (2-9-12) 


We next find the three components of V x F in cylindrical coordinates. 
To do so we make use of Stokes’ theorem and evaluate the line integral of 
F -dl around three infinitesimal loops, one perpendicular to f, one per- 
pendicular to 6, and one perpendicular to k. We take a loop perpendicular 
to f first, bounded by the coordinates 6, 6 + dO, z, and z + dz. We have 
then, applying Stokes’ theorem, 


t-(V x F)rd0 dz = [F,(z) — F(z + dz)] rdo 
+ [F,(0 + d0) — F,(6)] dz 


-(+$- *), re 


r 06 oz 
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Hence 
1 OF, OF 
(Vx F)-f= = aa = a (2-9-13) 


For a loop perpendicular to @ and bounded by r, r + dr, z, and z + dz, 
we have 


6-(V x F)drdz = [F,(r) — F,(r + dr)] dz 
+ [F(z + dz) — F,(z)] dr 


OF OF. 
-( _— 2) dr de 


Oz or 
Hence 
A we OF. 
(V x F)-@= aa — oe (2-9-14) 


Finally we take a loop perpendicular to k. We have then 
(V x F- k)rdr d0 = [F,(0) — F.(60 + d0)] dr 
+ Fj(r + dr)(r + dr) dé — F,(r)r dO 


Oy, Fe 1 OF, 
or r r 00 


) r ar do 
Thus 


Vx Eater 30 (2-9-15) 
Combining Eqs. (2-9-13) to (2-9-15), we have 
1 OF, OF,\,. (dF, OF,\, 
vxr=(+ 30 - Ses (Z- =) 6 
OF, Fe @h eh). 
sz (? es mak (2-9-16) 


We shall return very shortly and make use of cylindrical coordinates 
for the solution of specific problems. In the meantime we turn to the spherical 
coordinate system shown in Fig. 2-17. 

The spherical coordinates of a point are its distance from the origin r 
and its polar and azimuthal angles @ and w. (Note: We use & to denote 
azimuthal angle to avoid confusion with the potential function g.) The 
transformation equations relating x, y, and z to r, 6, and w are given by 


x =rsin @ cosy 
y=rsin@siny (2-9-17) 
z =rcos 6 
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~ 


, ee ee 


x 


Fig. 2-17 In the spherical coordinate system a point is designated by its 
distance from the origin r, its polar angle 6, and its azimuthal angle yp. 


or 


r= fx? ye 


Z 


cs: 6 = ee (2-9-18) 
af x? ye 

x 
ft ag?" 
Again we set up an orthogonal coordinate system at the point of interest. 
The unit vector f points in the direction of increasing r. The unit vectors 
6 and w point, respectively, in the directions of increasing 6 and w. As can 
be seen from Fig. 2-17,# x 6 = w. 

Using the same arguments as before, we evaluate Ve by determining 
the rate of change of g with respect to distance along each of the three 
mutually perpendicular unit vectors. The rate of change of @ with respect 
to distance along f is just dg/ér. The rate of change of @ with respect to 


cos y = 
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eet 
distance along the @ direction is 7 a The rate of change of @ with respect 
Be 
rsin@ dw 
_ 0p R 1 dg j 1 Ogee. 
Or r 00 rsin 6 dw 


to distance in the wy direction is Hence we have 


(2-9-19) 


Using precisely the same techniques as we used in dealing with cylindrical 
coordinates, we can evaluate V- F, V x F, and V7. We obtain 


von = < (PF) a — < (sin 0 F,) + - = 7 = (2-9-20) 
V = sl ge cine) - Fe] 
+ pake ESE M6 asm 
Gee A) aaa 
ee (2-9-22) 


r? sin? 6 dy? 


We now proceed to make some use of all this newly acquired knowledge. 
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Now that we have learned to express Laplace’s equation in three useful 
coordinate systems (cartesian, cylindrical, and spherical), we might give a 
bit of thought to the problem of solving the equation. In general we will 
find ourselves with an infinitude of possible solutions, and we will have 
to refer to the specific boundary conditions at hand to find the correct 
combination. What we hope to do is to make our procedure orderly, 
remembering always that only one solution of Laplace’s equation can 
satisfy the complete set of boundary conditions. 

We start with the simplest system, cartesian coordinates. The pro- 
cedure we follow here will be a quite general one and will be justified more 
thoroughly after we have worked with it some. We will search for solutions 
to V7o = 0 of the form 


p(x,y,z) = X(x) YO)Z(z) (2-10-1) 


Inserting this product form into Laplace’s equation, we obtain, after 
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dividing through by ¢(x,y,z), 
Lex Ley 1 eZ 
X a? Yager Z a 
Since the three variables x, y, and z are all independent, the three parts 
to Eq. (2-10-2) must all be constants. That is, 


0 (2-10-2) 


Ngee Ys 

——> = 2X 

axe” 

a = pry (2-10-3) 
a 

ae 92 

provided that 
a? + B2 + 2 =0 (2-10-4) 


(Needless to say, «, 8, and y cannot all be real.) We have then 


X = Aye™ + Bye™™ 
Ww = A, efy + B, e Fy (2-10-S) 
Zz = A;e” + B, (Se 


The possible solutions of the form given by Eq. (2-10-1) are thus innumer- 
able—every choice of A,, B,, A,, B,, A3, and B, will do. In addition we 
can choose a, f, and y as we like provided that they satisfy the simple con- 
straint given by Eq. (2-10-4). What are we to do next? The answer is best 
given by solving a specific problem in which we apply a set of boundary 
conditions to determine g completely. 

We consider a box made up of six plane sheets as shown in Fig. 2-18. 
The planes at x = 0, y = 0,z = 0,x = a, and y = bare grounded (9 = 0). 
The plane at z = c has a potential distribution on it given by g(x,y). We 
wish to know the potential everywhere within the box. 

We begin by inserting the boundary condition at x = 0 into Eq. 
(2-10-5). Since g(x = 0) must vanish for all y and z, we must have 


X(0) = A, + B, =0 (2-10-6) 


Similarly, making use of the boundary conditions at y = z = 0, we can 
write 


A, + B, = 0 (2-10-7) 
AR 0 (2-10-8) 


If we now go to the wall at x = a, we find 


Aye" + Bie *=0 (2-10-9) 


2-10 SOLVING LAPLACE’S EQUATION IN CARTESIAN COORDINATES 71 


Fig. 2-18 We solve for the 
potential on the inside of a box, 
all the walls of which are 
grounded except for one. That 
wall at z =c has a potential 
distribution on it given by 


Po(x,y). 


In order that Eqs. (2-10-9) and (2-10-6) both hold we must have 


or 
erm = | (2-10-10) 


For Eq. (2-10-10) to be true, we must have 


nit 
= — 2-10-11 
t= — ( ) 
where n is an integer. Combining what we learned and remembering that 
i9  ,—i0 
sin 0 = — (2-10-12) 


we can conclude that 


X(x) oc sin = (2-10-13) 
Similarly 
¥(y) o sin = (2-10-14) 


where m is also an integer. For every choice of integers m and n we have 
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then a possible solution 


Pam(XYs2) = Ay. m SiN — sin = sinh yy, m2 (2-10-15) 
where 
nm mn? 
Yam = a + pz (2-10-16) 


We are now ready to insert the final boundary condition at z = c. 
We have then 
see sin y (2-10-17) 


p(x,y,C) = Po(x,y) = os (A, m Sinh Yume) sin a 


nym=1 
Our job now seems to be reduced to finding the set of coefficients 
A,,m- But, do such a set of coefficients exist? At this point we must state 
without proof the fundamental theorem of Fourier analysis.! Any function 
of x defined over the interval from x = 0 to x = a can be expressed as a 
linear combination of terms of the form sin (mzx/a). That is, for any 
J (x) there exist a set of numbers B, such that 
= . 
s1E3) B, sin — x 
n=1 a 
over the interval from 0 to a. In the case of two dimensions x and y, any 
function f(x,y) defined over the interval 0 < x < a and 0 < y <b can 
be represented as a double Fourier series: 
= _ nm . mn 
f(xy) = YB, Sin — x sin — y (2-10-18) 
n,ym=1 a b 
Returning to Eq. (2-10-17), we find the coefficients A,,,, by a standard 
ln 
b 
are integers. We then integrate over both x and y. Remembering that 


procedure. We multiply each side by sin at x sin — y where both k and / 
a 


a 
nh, Wen a 
| sin — x sin — x dx = — b, 


0 a a 2 
js sin ae ees = pis 1 —— 
0 b bes Pies 
we have 


4 : et. me 
Aum = wa, dx j, dy Qo(x,y) sin > * sin Sy (2-10-20) 


‘For further explanation see Chap. 6. 
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We are thus in a position to evaluate the coefficients A,,,, explicitly, and 
we have solved our problem. 

Let us go back now and say a few words about the key step in our 
procedure for solving Laplace’s equation. We searched for solutions in 
the form of products of functions of the independent variables. We then 
noted that at least in the solution of one simple boundary-value problem 
we could write our answer in terms of a sum over these solutions. The 
remarkable fact is that any solution to Laplace’s equation can be expressed 
as a sum over such product functions and that is why we have lost no 
generality by proceeding in this manner. We will always look for these 
product-function solutions and then spend our time determining the 
coefficients from the boundary conditions. 


2-11 SOLVING LAPLACE’S EQUATION IN CYLINDRICAL COORDINATES 


As we pointed out earlier, most simple potential problems with a great deal 
of symmetry can be handled best by means of cylindrical or spherical co- 
ordinates. In this section we consider the solution of Laplace’s equation 
in cylindrical coordinates subject to appropriate boundary conditions. 
Making use of Eq. (2-9-12), we write 

1 alt 1 ao oo 

Cy pS asaeneaetig by sd ee ee) 2-11-1 

ae ( 4 + 30” oe ( ) 
For simplicity let us begin by limiting our discussion to cases where @ does 
not depend on z. That is to say, 07/2” = 0. We look again for product 
solutions. In this case they have the form 


y(r,0) = R(ry@(0) (2-11-2) 
Our equation becomes 

@ d/ dR\ R #0 _ 

r a\' a) r de 
Let us multiply by r? and divide by RO. We have then 
1 @7o 
— —{r—]=-—-—, 2-11-3 
R dr (: ) © de le) 


Note that the right side of the equation depends only on 6 and the left side 
only on r. Since the two are independent variables, the two sides must 
separately be equal to a constant, which we write as k*, We have then 
20 
— = — k*@ 2-11-4 
do? ( ) 
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and 

r = (: =) = k?R (2-11-5) 
Consider first the case where k + 0. The first equation then has as a solution 

0 = A, cos k6 + B, sin k@ (2-11-6) 
In the event that kK = 0, we have 

© = Ay + 06 (2-11-7) 


Solutions corresponding to imaginary k, that is, negative k?, cannot be 
single valued as we increase @ by 2x. Hence they are excluded. In fact the 
requirement of single valuedness implies that A, = B, = 0 unless k is an 
integer. It also implies that « = 0. Hence we can write in general 


0 = A, cos nO + B, sin nO wherein = 0; 1, 2,... (2-11-8) 
To find R(r) we first expand in a power series: 


R(r) = 3 Cc," (2-11-9) 


7 a 


Substituting back into our differential equation (with k = n + 0), we find 


d/{ dR\ _ ee 
Cen (: =) =) Con’ =Car (2-11-10) 
Hence 
¥ Cy(m? — n2)r" = 0 (2-11-11) 


In order that this be zero, we must have each and every term zero, since 
no power of r can be expressed in terms of any other powers of r. Hence 
either 


or 


iit and ii = srif 


In the event that n = 0, we have 


R=f6Blinr+y (2-11-12) 
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Finally then we can combine all our constants and write the general solution 
as 


g(r,é) = 2 A,r" cos nO + Y Bur sin n8 + se C,r-" cos nO 


n=1 
+ } D,r-"sinn6 + Colnmr + Dy = (2-11-13) 


where all the constants are arbitrary. 

We now apply this to the solution of a simple problem. Let us examine 
the case of an infinitely long, conducting, uncharged cylinder in an electric 
field which approaches uniformity at large distances from the cylinder. 
The field is taken to be perpendicular to the axis of the cylinder. The radius 
of the cylinder is a (see Fig. 2-19). For large x we have py = —E x (we 
obviously must remove the restriction @ = 0 at infinity if we want a constant 
electric field at infinity). 


gp(large x) = —Eprcos 6 (2-11-14) 
The condition that the cylinder be uncharged yields 


2 charge =, 
unit length 
1 OP a4 a *" dp 
en ae ses a dé (2-11-15) 
4n he length Ee 


We expect, by the symmetry inherent in the situation, to find that g = 0 
on the cylinder. Let us assume this to be true and see where we get. Since 
at large r we must have g — —Egrcos@, we can set [see Eq. (2-11-13)] 


Aa () forn + 1 
ii = for all n (2-11-16) 


EE catia 


Fig. 2-19 An infinitely long, 
conducting cylinder is placed in 
Se a uniform field. 
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Now at r = a, g = 0. Hence 


—E,acos@+ ) C,a~" cos n 


n=1 


n 


+ ¥ D,a-"sinn6+ Colna+ Do =0 — forall@ (2-11-17) 


Since all functions cos n@ are linearly independent, we have immediately 


De) for alln 

So a (2-11-18) 
C= forn = 1 

—E,a+C,a ~=0 


Thus 
2-11-19 
Cy — Ea’ ( ) 
We have then 
a’ 
y = Ey (= = 7 cos 6 (2-11-20) 
We can calculate the surface charge everywhere: 
Sea 
~ 4x \ér f 
ea Ee 
pe a ele —— 2-11-21 
ri é + 1) 00s 57, 008 8 ( ) 
The charge per unit length is 
2n 
pee | potent 0 (2412) 
2n 0 ~ 


This completes our solution. 
We now return to Laplace’s equation (2-11-1) and remove the restric- 
tion of z independence. We search for solutions of the form 


o(r,6,z) = R(r)O(@)Z(z) (2-11-23) 


Substituting, dividing through by g, and multiplying by r?, we rewrite 
Eq. (2-11-1) as 

r d{[ dR r az ee 

lepeniey — ee eee ie 

R ar (: 4 Z dz @ de? Cane) 
Again we realize that the two sides of Eq. (2-11-24) must be set equal to a 
constant. Referring back to Eqs. (2-11-4) to (2-11-8), we have once more 
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© = A, cos n@ + B, sin nO (2-11-25) 
where n is an integer. Inserting this back into Eq. (2-11-24), we obtain 
ied / aR n? iaeZ pelees 
— —{r— — = — o = 
rR dr\ adr r Thaw bas ( ) 


We next find the possible solutions for Z by requiring that both sides 
of Eq. (2-11-26) be equal to a constant which we set equal to —a?. We 
have then 


oe = o2Z (2-11-27) 
The solutions for nonzero values of « are just 

Z,(z) = C,e7 + De (2-11-28) 
In the event that a = 0, the solution to Eq. (2-11-27) becomes 

Z(z) = Co + Doz (2-11-29) 


Finally we must find the solution for R, ,(r) corresponding to a particular 
choice of n and a. Our differential equation is 


a a ig’ — 
7 ar ( dr ) ate (« = “Ron =0 (2-11-30) 


It is convenient at this point to let u = ar for the case where « + 0. We 
then divide Eq. (2-11-30) by «? and obtain the standard form for Bessel’s 
equation: 


OY a ae ae n? 
ae dh )+(-B)aa=0 


The solutions to this equation are best expressed as a power series in u. 
As usual there are two linearly independent solutions, J,(u) and N,(u). 
The first of these, J,(u), is called a Bessel function of the first kind and is 
defined as 


= u\" & (—1)* Uu 2k . 
J,{u) = ($) Laer) (2-11-31) 


The function N,(u) is often called a Neumann function and is defined as 


2 u Ty re ace En 3 Fe 
ae ae ($) o (5) ot (5) 


k i cal = ue nan L (n ny er 1)! +) 
(> if io +) 7 (5) = k! (3 


(2-11-32) 
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Needless to say the use of a digital computer is strongly recommended 
when dealing with either of these functions. In any case, the general solution 
to the radial equation is 


Rin) = ExndI fOr) + Fan N,(ar) (2-11-33) 


where E, ,, and F,,, are arbitrary constants. 
In the event that « = 0 the two solutions for Ro,,(r) are the same as 
those for the differential equation (2-11-5). 


Rol?) = Font” + Eont" (2-11-34) 
Finally, if 2 is also equal to zero, we recall that 
Ro ol") = Eo,o + Foo lnr (2-11-35) 


Again the values of the various coefficients will be determined by 
boundary conditions. Quite often we will find that only specific and discrete 
values of « will permit us to fit our boundary conditions and the first part 
of our effort will be devoted to finding these values. We will then make use 
of completeness and orthogonality to determine the actual values of the 
coefficients themselves. 

The detailed solution of problems of this sort is somewhat beyond the 
scope of this book. The reader is referred to innumerable treatises on classical 
potential theory or to more advanced texts in the field of electromagnetism. 


2-12 THE SOLUTION TO LAPLACE’S EQUATION 
IN SPHERICAL COORDINATES 


Perhaps the most useful of all coordinate systems in the study of electro- 
statics, particularly at the microscopic level, is that of the spherical co- 
ordinates. We will limit ourselves to physical situations with complete 
rotational symmetry about the z axis (axial symmetry). The extension to 
nonsymmetric situations is straightforward and is left to the reader in 
consultation with more advanced textbooks. 

Before proceeding with Laplace’s equation, it is convenient to make 
a change of variable. We let » = cos @ and have thus 

s Z & s, — = (2-12-1) 
Substituting back into Laplace’s equation (remembering that there is no 
Ww dependence), we have 


1 1 
V0 =a ("S) +2 xe Lo (2-12-2) 


We search for a product solution of the form 


‘See, for example, J. D. Jackson, “Classical Electrodynamics,” p. 75, Wiley, New York, 1962. 
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gy = Fir) P(u) (221253) 


Substituting Eq. (2-12-3) into Eq. (2-12-2), we obtain in the same manner 
as before 


id (ay wed » aP | 


Let us begin by examining the radial equation. In most problems 
that we are likely to encounter ¢ will be finite at all points except perhaps at 
r = 0 and r = oo. The reason for allowing @ to possibly diverge at r = 0 
is that the region about r = 0 may be excluded from the problem. For ex- 
ample, if we are dealing with a conducting sphere of radius a, we are only 
interested in g for r > a. The reason for allowing @ to possibly diverge as 
r — oo is to permit the inclusion of cases like a constant electric field out 
to large distances. Since at all other points ¢ is finite, we can expand F(r) 
in a power series, allowing all possible negative and positive powers of r. 

ioe) 

Er = year” (2-12-5) 

T=) — 00) 
If we now substitute this power series back into the right side of Eq. (2-12-4), 
we find that 


o(r =) = ¥ A,mm+1)r"=kF= VY A,kr™ (2-12-6) 


In order that this equality holds for all r, it must hold for each and every 
coefficient of a given power of r. Hence 


mm + 1) =k (2-12-7) 


We conclude then that k can take on only specific and discrete values, 
namely, those which can be obtained through the product of m and m + 1 
where m is an integer. The values of k which are possible are all positive, 
as can be determined from inspection. 

For each value of k that permits a solution, there are two values of m 
which correspond to that k. We list some of these in order in Eq. (2-12-8). 


k Possible m F(r) 

0 v=) Fy(r) = Ay + Bor" 
2 2 F,(r) = Ayr + By? 
6 2 8 F,(r) = A,r + Bar”? 


= (2-12-8) 
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Of course, for each value of k (or n) that is possible, we have a corresponding 
equation for P(u). The solution to this equation will be called P,(u). Our 
final solution for g will then be 


plr.#) = >, (Ape SB ee) (2-12-9) 
n=0 : 


We now proceed to find P,(u), by means of the differential equation 


a (1 — Pr nd + n(n + 1)P,=0 (2-12-10) 
du qu 


As usual we will expand P,(u) in a power series. This time, since we want 
P,(u) to remain finite for all » between 0 and 1, we take only positive integral 
powers of p. 


Pw) = Y Anat! (2s) 
1=0 
We evaluate the first term on the left side of Eq. (2-12-10). 
d ap = z 
t — p”) a =) 4/0 - De? - Y ALI + De! 
1=0 1=0 


du dy 
(2s12812) 


The first sum on the right side of Eq. (2-12-12) can be rewritten slightly 
if we note that the first two terms are zero. 


] 
Ms 


oD A, 1d — Ip? = Anil — a 
1=0 


I 


il} 
N 


Y Ansa + 20 + Dy! (2-12-13) 
1=0 


We can now enter all this information into Eq. (2-12-10), coming up with a 
relationship among the various coefficients: 
Y [Ano + 20 + 2 - A, + 1) + A,intn + 1)] un! = 0 
1=0 
(2-12-14) 
As we have done many times before, we set each coefficient of a given 
power of y equal to zero. We then have 
An te) at |) 
An, (+ 1) + 2) 


(2-12-15) 


This equation, known as a recursion relation, is extremely powerful! It 
relates any given coefficient to the one two places further down the line. 
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Thus, if the first two are known, then the entire series is known. (This 
technique is a very commonly used one for the solution of differential 
equations in terms of power series. The reader might try the method on the 
harmonic equation d*x/dt? = —k?x.) 

To develop our functions P,(u) we first consider the case where n is 
even. We notice immediately that if / is odd, there is no value of / for which 
i + 1) — n(n + 1) is equal to zero. Hence, if one odd term exists, then all 
odd terms exist. As we take / larger and larger, the ratio A, ,/A,,, approaches 
unity. We are clearly in for trouble then at » = 1 because our series diverges 
at that point. There is one and only one solution to our dilemma, to set 
A,,, equal to zero, in which case all odd terms disappear. As far as even / 
is concerned, there is no problem. When we come to the term where / is 
equal to n, the ratio A,,, /A,,, becomes equal to zero. Thus A,,42 = 0 
and the series terminates, all further terms vanishing. 

We conclude then that if m is even, then P,(u) contains only even 
powers of p. Only terms from p° to yu” are present. 

.On the other hand, if m is odd, we can apply the identical argument to 
exclude even powers of y. Again the highest power of , in the series is p”. 
The series runs from yp’ to py”. 

It is useful to find the first few solutions for P,(u). Since any solution 
can be multiplied by an arbitrary constant and still satisfy the same differen- 
tial equation, we must choose some normalization. The usual convention 
requires that ; 


(ori a (2-12-16) 


Obviously, for this normalization, 
i 1) (2-12-17) 


The functions P,(u) are called Legendre polynomials. Since P)() is a con- 
stant, our normalization requires that 


P,(u) = 1 (2-12-18) 


Next we note that P,() is proportional to . By our convention then, 
we have 


Pw) =» (2-12-19) 


To obtain P,() we must first make use of Eq. (2-12-15) to obtain 
A2,2/A2,0- 


Age? ong (2-12-20) 
Azo 2 
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Choosing A, 9 according to our normalization, we have 


3u? — 1 
P,(u) = i (2-12-21) 
Going on to P;(y), we first find that 
ee (2-12-22) 
A314 3 

Fixing A; , by convention, we have 
3 
—3 

P;(u) = ar (2-12-23) 


We could proceed in this manner ad infinitum to find all the functions P,(). 
It is more convenient, however, to make use of Rodrigues’ formula 

Lopate 
2"n! du" 
We then note that the most general solution to Laplace’s equation in the 
case of axial symmetry can be written as 


P,(u) = aay (2-12-24) 


«o 
o(r,8) = > (4, + rir) Palco 6) (2-12-25) 
n=0 

Again our basic problem will be the determination of the coefficients. 
This can be done through matching boundary conditions at a complete 
set of boundaries to a charge-free region. (These conditions can consist of 
the specification of either the potential at the boundary or of the normal 
component of electric field at the boundary.) Alternatively, we may be given 
a charge distribution and asked to find the potential outside of the dis- 
tribution. In the next two sections we will discuss some specific methods 
of handling these problems. 

Before we conclude this section we should note that one of the remark- 
able facts about the functions P,(u) is that they form a complete and 
orthogonal set over the range —1 < pu S 1. That is to say, any function of 
pcan be expressed as a series in P,(u) and no one of the P,(u) can be expressed 
entirely in terms of the others. 

We note first that 


| ONAOEME Be (2-12-26) 


We also note that 


1 i 
| ; [PW]? du = = (2512227) 
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The fact that the Legendre polynomials form a complete set permits 
us to write, for any function f(y) that is finite in the range —1 S p S 1, 


fi) = ¥ A,PAW) (2-12-28) 


We now multiply both sides of Eq. (2-12-28) by P,,(u) and integrate from 
B= —-ltop= 41: 


[- onr,t du=> I An P,(H) y(t) du 


7 ona 1" 
Hence 
2m+1 [} 
A, = D | FWP CH) du (2-12-29) 
-1 


Obviously, were we to choose f(y) to be P,(u), then A,, would be zero unless 
n = m. Our P,(u) thus form an orthogonal set in the sense that no one can 
be expressed in terms of any of the others. 


2-13 SOLVING BOUNDARY-VALUE PROBLEMS IN 
SPHERICAL COORDINATES WITH AZIMUTHAL SYMMETRY 


We now develop some of the techniques for evaluating the coefficients in 
Eq. (2-12-25), the general solution to the azimuthally symmetrical Laplace 
equation. In particular, we will assume here that the potential or its deriva- 
tive normal to the surface is specified at each and every boundary of our 
charge-free region, and we will then force the potential to fit. We will find 
that the set of coefficients will then be completely determined. 

Our discussion is best done in terms of a set of specific problems, 
each of which has some widespread applicability. The extension to other 
classes of problems will be straightforward for the reader once he has 
grasped the general principles. 

We begin by specifying the potential o(a,6) over a spherical surface 
of radius a (see Fig. 2-20). We will assume that no other charges are present 
anywhere and that the potential at infinity is zero. We are interested in 
knowing the potential everywhere, both inside and outside the sphere. 

Since the potential at infinity is zero, the outside potential can be 
written as 


foe) out 


B, 
Poult, 9) as » pet P,(cos D) (2-13-1) 
n=0 
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Fig. 2-20 The potential (a,0) (or its 
derivative with respect to r) is specified at 
the spherical boundary r = a. It is as- 
sumed that no other charge is present and 
that @(oo) = 0. We would like to know 
the potential g(r,@) everywhere, both out- 
side and within the sphere. 


This potential must be equal to y(a,) at r = a, leading to the observation 
that 
fo 0) B out 
o(a,0)= > ae, P,(cos 0) (2-13-2) 
n=0 
Multiplying both sides by P,,(cos 6) and integrating with respect to cos 0 
from cos 9 = —1 to cos 6 = +1 [see Eq. (2-12-29)], we have 


(2m + l)a™*} [. 


out __ 
Bo" = 5 


e(a,9)P,,(cos 8) dcos 6 (2-13-3) 
1 


We have thus found the potential g(r, 0) outside the sphere. Inside the sphere 
we must have B, = 0 or else our solution will diverge at r = 0. We have 
then 


Pin(t,8) = a 7 r" P,(cos 8) (2-13-4) 
=0 


n 


Again we are told what the potential g(r,0@) is at r = a. Substitution of 
r = a into Eq. (2-13-4) yields 


(a0) =. > A, i" a" P,(cos 8) (2-13-5) 


n=0 


The coefficients are thus given by 


2m eee 
Ae _ | (a,6) P,(cos 6) dcos 6 (2-13-6) 
=I 
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Note incidentally that the continuity of @ across our surface at r= a 
implies 
B out 
m 


jue oo (137) 


Now that we have such a powerful result, let us try it on the simplest 
physical situation and see if it works. We will take g(a,0) to be a constant, 
namely, @o. In that case we know from experience that the potential inside 
should be constant (this is like a conducting spherical shell) and the potential 
outside should be 


a 
P(r) = Po - (2-13-8) 
Let us see if we come up with the same answers with our general method. 


We have 


1 


1 
| e(a,9)P,,(cos 0) dcos 0 = @o | P,,(cos 0) dcos @ 
1 sf 


a 
=p | P,(cos 0) P,,(cos 8) d cos 0 
1 


(2@o ifm =0 


(2-13-9) 
0 ifm #0 
Thus 
Ay” = Qo 
Ape OL ree 0 
(2-13-10) 
By" = a9 
oe () form #+ 0 
We finally obtain, as expected, 
4Po 
oe) 
Poul) =~, (282i) 


Pint, 9) = Po 

As we said earlier, we could specify the normal derivative of the 
potential at the surface (r = a) and use that to obtain the coefficients 
B,°" or A,®. Since in general the derivative of @ with respect to r is not 
continuous at the surface (there being some charge at the surface), we 
must use (69,,,/Or),-, to determine the B,"' coefficients or (0¢;,/0r),=, to 
determine the A,,'" coefficients. Once we have determined either of these 
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sets, the other can be determined by requiring that g be continuous. That 
is to say, 9i,(4,0) = Poy,(@, 8). 
For example, we can write 


foe} Bou ' 
(=) = — Se: econ (2-13-12) 
or r=a n=0 a 


This would yield the equation for B,,“': 


= m+2 1 
pe, | (=) P,,(cos 8) dcos 6 (2-13-13) 
1 r=a 


2(m + 1) = Or 
The continuity condition expressed in Eq. (2-13-7) would then tells us A,,'": 
; B out 
Pie ie aint (2-13-14) 


Alternatively we might have specified (¢9;,,/0r),-,, obtained the coefficients 
A, i", and then applied Eq. (2-13-14) to get B,,“. 

Let us now attempt a slightly more difficult problem. Let the potentials 
be specified on each of two concentric spheres at radii a and 3, respectively. 
We would like to find the potential between the spheres. We begin by 
writing down the potentials at a and b: 


o(a,0) = ¥. (4ye = sats ) (co 6) (2-13215) 
(b,6) = ¥. (4,0 He sett) Pleo 6) (2-13-16) 


From Eq. (2-13-15) we determine the linear combination A4,,a" + B,,/a™*} 
for arbitrary m: 


B 2 VB 
Avg oo = Mit | ¢(a,0)P,,(cos 8) dcos 6 (2-13-17) 
1 


q’tt 5) 


We obtain another equation for A,, and B,, from Eq. (2-13-16): 


Be 2m+1 [} 
Ane” a pmtt = “| 
Using both Eqs. (2-13-17) and (2-13-18), we can find the A,, and the B,, 
coefficients, completing the solution to our problem. 

We next consider the problem of a spherical grounded conducting 
ball of radius a that is placed in a “constant” field. (Obviously, after the ball 
is introduced the field is no longer constant near the ball.) We assume that 
the potential is given by 


o(r,6) = —Egrcos@ (2-13-19) 


¢(b,6)P,,(cos 8) dcos 6 (2-13-18) 
1 
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at large distances from the ball. Atr = a, the potential is taken to be zero. 
Now referring back to our general solution, Eq. (2-12-25), we can im- 
mediately set all A, except for A, equal to zero. A, is of course equal to 
— Ep. 


A, = —Epo 


(2-13-20) 
A, = 0 for) 1 
At r = a, we have 
foe) B, 
p(a,0) =0 = —E,acos8+ > Fas P,(cos 0) (2-13-21) 
n=0 


In order that this hold for all 0, we must set the coefficient of each and every 
Legendre polynomial equal to zero. Hence 


B, = Ega? 
(2-13-22) 
B, = 0 eeiyees | 
We finally have then | 
E 3 
g(r,0) = Gz + ~ )eos 0 (2-13-23) 


The (normal) electric field at the surface is given by —(0@/0r),—,. 


Eo -($) = 3E, cos 6 (2-13-24) 
The surface charge density is 
E,  3Ecosi0 
Sa (2-13-25) 


The total charge is zero, as might be expected. 

Finally, we consider what would happen if the conducting sphere we 
have just considered were to be replaced by a dielectric sphere of uniform 
dielectric constant «. The absence of free charge ensures that V: D = 0 
within the sphere. But V: D = eV- E = —cV’@ in the case where ¢ is a 
constant. Hence @¢ satisfies Laplace’s equation within the dielectric as well 
as outside of it. Only at the surface of the dielectric is V*@ not equal to zero. 

We will allow g'(r,0) to be the solution within the dielectric and 
g''(r,@) to be the solution outside. Our boundary conditions are as follows: 


1. og! = —Eprcos @asr > o. 
2. The potential is continuous at the boundary of the dielectric. That is, 


p'(a,6) = e"(a,6). 
3. The potential is finite at r = 0. 
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4. The normal component of the displacement field D is continuous at the 
boundary. Thus 


(20) _ (20! 
or Pe oa or r=a 


The continuity of the tangential component of E at the boundary is of 
course assured by boundary condition 2. 
Expressing both g'(r,6@) and o"(r,6) in conventional form, we have 


g\(r,0) = y (4 i ae By =) Alcos 6) (2-13-26) 
n=0 

TA 5 ( ae BE a 6) (2-13-27) 
ne 


Applying boundary condition 1 tells us that 
A," = —E, 


2-13-28 

Aesth forn + I ( ) 
Applying boundary condition 3 forces us to write 

B,'=0  foralin (2-13-28) 


Using the remaining coefficients, we next apply condition 2. 


(oe) co B 
> A,'a"P,(cos 0) = —Eyacosd+ > ae P,(cos 8) (2-13-29) 
n=0 n=0 


We deduce from this that 


Il 


B 
A,'a = —E,a + =a (2-13-30) 


B, 
A,a =—~ forn# 1 (2-13-31) 


Finally, we apply condition 4. 
é ) A,'na""'P,(cos 0) = —E, cos 0 


+ 1B" 
-> i ot ee " P.(cos 6) (2-13-32) 
We deduce, as before, that 


2B II 
rae —E, = 3 (2-13-33) 
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(n + 1)B,2 


qit2 


i) 


cA, na - forn + | (2-13-34) 
Equation (2-13-31) implies that A,' and B," have the same sign, for n + 1. 

On the other hand, Eq. (2-13-34) implies that they have opposite signs for 
n + 1. We conclude that 


A,| = B,! = 0 forn + 1 (2-13-35) 
Finally, combining Eqs. (2-13-30) and (2-13-33), we have 
3E, 
Aji= —-—% 
; e+2 
(2-13-36) 
eee 
e+2 


We can now write down the solution for g! and g" everywhere: 


3E or cos 8 
I ee) L139. 
e(r,0) = ce (2-13-37) 
INE va? 
I al 0 2132 
y'(r,0) = —Eprcos 6 + (= " 3) = 72 COS 0 (2-13-38) 


A glance at Eq. (2-13-37) tells us that the field within the dielectric is a 
constant in the z direction and is equal to 


, 3 a 
E(inside) = ———~ Epk 3-13-39 
(inside) a ( )) 
In the event that « = 1 (no dielectric) this reduces as expected to Eok. 
The field outside the dielectric is clearly composed of the original constant 
field E,k and a field which has a characteristic dipole distribution with 
dipole moment of 


— 1 

E,a*k hi 
[ay a a (2-13-40) 
We might check to see if the dipole moment we obtain this way agrees 
with what we expect from integrating the dipole moment per unit volume 
P over the sphere. Inside the dielectric we have 


-29( 5) Fok (2-13-41) 
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Inasmuch as the dipole moment per unit volume is constant, we can obtain 
the total dipole moment through multiplying P by the volume of the sphere. 


p = P(3na°) 


eo 

 e+2 - 

This completes our study of spherical boundary conditions. We will 

next learn how to determine the all-important coefficients A, and B, in the 
event that the charge distribution is completely known. 


E,a*k _as before 


2-14 THE MULTIPOLE EXPANSION OF AN AZIMUTHALLY 
SYMMETRICAL CHARGE DISTRIBUTION 


Suppose a distribution of charge in space were limited in physical extent 
and azimuthally symmetrical and we wanted to find the potential it produced 
in some charge-free region. Presumably the potential could be written in 
the form given by Eq. (2-12-25) and our only job would be to find the co- 
efficients A, and B,. We will now develop a beautiful and general method 
for determining these coefficients and incidentally gain a great deal of 
insight into the types of fields produced by various charge configurations. 

We need to distinguish two cases, that where the charge is all at a 
smaller radius than the point at which we wish to determine g and that 
where the charge is all at a larger radius than that point. Obviously the 
solution to any problem can be broken into a sum of these two types of 
solutions, and hence we lose no generality by treating only these situations. 

We consider the first of these cases. Let r’ and 6’ be the coordinates 
referring to the charge distribution (see Fig. 2-21) and let r and @ be the 
coordinates at which we wish to evaluate the potential g. (We assume that 
r’ Sr at all times.) In order that g > 0 as r > oc we must require that 
A, = 0 for all n. Hence 


[o.0) 


B 
g(r,0) = > a P,(cos @) (2-14-1) 
Our job is then to determine the coefficients B, in terms of properties of 
the charge distribution. We recall our old expression for g. 


ey. 
p(r,6,y) = | ia ) dV’ (2-14-2) 
state aon | 


The trick is, of course, to express 1/|r — r’| in a power series and then 
evaluate the set of individual terms. We remember that as long as x < 1, 
we can write 


1 pa ag HO 2 1 


— (2-14-3) 
lyase ag 2 2(4) 2(4)(6) 
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i 

| Fig. 2-21 We consider the potential at a 

1 point (r,6) produced by an azimuthally 

| symmetrical charge distribution p(r’,6’), 
where r’ <r. 


Now 
1 1 
r—r| r? +r’? — Oper’ 
lle Sa SE 
resil Ae = a zai (2-14-4) 
r r 


Expanding by means of Eq. (2-14-3) and grouping terms with the same 
power of r’/r together, we have 


Deg te eg eet = Fe 
lr—r| or r? oF 
5 me) oe or’ 22) 
+e | (2-14-5) 


Our next step is to put Eq. (2-14-5) into a somewhat more pliable 
form. We first note that 


r = (rsin 8 cos w)i + (rsin @ sin W)j + (rcos 6)k (2-14-6) 
r’ = (r’ sin 6 cos W’)i + (r’ sin 6 sin w’)j + (r’ cos 6’)k (2-14-7) 


Hence, in complete generality, 1/\r - r’| is a function of r, 7’, 0, 0’, w, and 
y’. The fact that we are dealing with an azimuthally symmetrical charge 
distribution, however, permits us to evaluate the integral in Eq. (2-14-2) 
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at w = 0 and apply the result at any value of y. Therefore, we need only 
consider Eq. (2-14-5) at y = 0. Inserting this into the above we find 


rer’ = rr(sin @ sin 6’ cos w’ + cos 6 cos 6”) (2-14-8) 
(r-r’)? = r?r’2(sin? 6 sin? & cos? W’ + cos* 6 cos? 6’ 
+ 2 sin 6 sin 0’ cos 0 cos @’ cos wy’) + (2-14-9) 
(r-r’)? = rr’(sin? 6 sin? 6’ cos? y’ + cos? 6 cos? 0’ 
+ 3 sin? 0 sin? 6’ cos 6 cos 0 cos? W’ 
+ 3sin 6 sin & cos? 6 cos? 6’ cos w’) (2-14-10) 


and so forth. 

Before proceeding, we again note that p has no & dependence. Hence 
we can carry out the integration of Eq. (2-14-2) over w’ first. This is equiva- 
lent to replacing the various powers of cos w’ above by their average values 
before integration. The average values are 


<cosw’> = 0 * <cos?W’>=4 <cos*’> =0 (2-14-11) 


This leads to the following results when we average the various powers 
of r-r’ over p’: 


<r-r’>,, = rr’ cos 8 cos 0’ 
<(r-1’)?>,, = r’r’?(cos? 6 cos? 6’ + 4 sin? 6 sin? 6’) (2-14-12) 


{r+ 9’)*>ay = r3(cos? 6 cos? 0’ + 3 sin? 6 sin? 6’ cos 0 cos 6’) — 


Finally, to average Eq. (2-14-5) over w’, we need to combine terms 
and observe that 


3(r-r’)? — r?r’? xe 
a r’* P,(cos @)P,(cos 6’) (2-14-13) 


S(r-r’)? — 3(r-r’)r’2 7? 
eee = r3r3P3(cos 0)P;(cos 6’) (2-14-14) 


Needless to say we can conjecture what the mth term in this series 
will be, and hence we write down immediately for Eq. (2-14-5) averaged 
over w’: 


1 ] r’ é 
(; = “) a [ + = P, (cos 6)P, (cos 6’) 
over y’ 


rN 2 
‘a (=) P,(cos 6)P,(cos 6’) 
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oe 
+ (5)  P3(cos 9)P;(cos 0’) + -- | 


2 = : ( “\ P,(cos 6)P,(cos 6’) (2-14-15) 


If we now finally insert this result into the integral for g(r,6) [Eq. (2-14-2)], 
we find 


© P (cos 6 
o(r,0) =>. —— o(r’,0’)(r)"P,(cos 6’) dV’ (2-14-6) 
n=0 charge 
distribution 


We conclude that the coefficients B, of Eq. (2-14-1) are given by 


Bo | p(r’,0’)(r')"P, (cos 6’) dV’ (2-14-17) 
charge 
distribution 


The terms B, are called the 2”-pole moments of the distribution. For example, 


Bo = p(r’,0’) dv’ 
charge 
distribution 
= total charge (or monopole moment) (2-14-18) 
B, = | p(r’,0’)z’ dV’ = dipole moment (2-14-19) 
charge 
distribution 


B, = | p(r’,0)(r’)? P,(cos 6’) dV’ 
charge 


distribution 
= quadrupole moment (2-14-20) 


Let us return to the beginning again and ask what ¢(r,@) would be 
like if the charge distribution were all at radius larger than r. (That is to 
say, r’ > r for all possible r’.) The expansion for g(r,@) that we must use 
now, in order that g not diverge at r = 0, is 


g(r,0) = >. A,r"P, (cos 4) (2-14-21) 


Equation (2-14-2) would still be appropriate for finding g, but the expansion 
of 1/|r — r’| must now be in a power series in r/r’. Obviously everything 
we have done so far in expanding 1/|r — r’| would be valid if we just inter- 
changed r and r’. Returning to Eq. (2-14-15) we now rewrite it as 


(i : 7) ma 5 y (y P,(cos 8)P,(cos 6”) (2-14-22) 
= n=0 


av 
over 
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We conclude then that 
ie | ee P,(cos 6’) dV’ (2-14-23) 
charge : 


distribution 


Amusingly enough, we can now find the potential even within the 
azimuthally symmetrical charge distribution itself, even though that is 
obviously not a charge-free region. We take the point (r,6) at which we wish 
to find @ and break the charge distribution into two parts, that with r’ <r 
and that with r’ = r. We add the two contributions together and lo and 
behold—we have ¢. In general then, in a case of azimuthal symmetry, 


o(r,6) = y fal pl’, O')P,(cos 6") 1, 


= (ry 1 
trot | p(r’,0’)(r’)"P,,(cos 6”) av’ P,(cos 6) (2-14-24) 


Let us try out this magnificent formula on a uniformly charged sphere of 
radius R (see page 33 and Fig. 2-2). The charge density p is a constant in 
this case and can be taken out of the integral. Equation (2-14-24) becomes 


1 
[remembering that | P,(cos 6’) dcos 6’ = 0 unless n = 0] 
al 


: 4np 
g(r,0) = sno | r dr + — ore |’ (r’)? dr’ 
R2 r2 
= dnp (= be _ (2-14-25) 


A simple calculation using the fields we previously determined from Gauss’ 
law will verify that Eq. (2-14-25) is right. 

The really interesting applications of the development we have just 
carried out are in the field of atomic physics. As we shall see in the next 
section, careful calculation of the energy levels in atomic systems will 
permit us to determine important constants relating to the nuclear shape. 
In any case we are now prepared to deal in a systematic manner with any 
azimuthally symmetrical charge distribution. 


2-15 THE INTERACTION ENERGY OF TWO NONOVERLAPPING 
AZIMUTHALLY SYMMETRIC CHARGE DISTRIBUTIONS; 
DETERMINATION OF NUCLEAR SHAPE 


We will make use of what we have just learned to calculate the interaction 
energy of two nonoverlapping azimuthally symmetrical charge distributions. 
As we will see, this will have an immediate consequence in showing us how 
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to determine the various multipole moments of the nucleus and hence will 
provide exceedingly valuable information about nuclear structure. 

We must first explain precisely what we mean by interaction energy. 
If we go back to Eq. (2-7-7) we recall that the total energy of any distribution 
of charge can be written as 


1 
Vroat = > | ; p(rip(r) dV (2-15-1) 


space 


Suppose that our charge distribution p can be broken into two distinct 
distributions p, and p,. As an example of this we can take the atomic system 
where the nuclear charge distribution might be taken as p, and the electron 
charge distribution might be taken as p,. In any case, we have 


pr) = p(t) + p2(r) (2-15-2) 


Now each of these charge distributions is responsible for a portion 
of the potential at any given position. We can write 


e(r) = oi) + o2(r) (2-15-3) 
where 
p,@) 
ne 7 AV’ 2-15-4 
¢;(r) ae ( ) 
and 
Q2(r) = | ae (2-15-5) 
all ane 


space 


Thus we see that the total energy as evaluated through the integral 
Eq. (2-15-1) can be broken into four parts. 


1 1 
Ororai = s|, pPi(r)g, (tr) dV + an p2(t)p2(r) dV 


space space 
l ] 
wi si Piltpa(r) dV + +| P2(t)p,(t) dV (2-15-6) 
all all 
space space 


The first two terms in the total energy are just the self-energies of the 
two individual charge distributions. The third and fourth terms constitute 
the interaction energy. Now we can easily see that the third and fourth 


96 PRINCIPLES OF ELECTROSTATICS 


terms are equal: 


\ puleoeere {) pr@)p2(t’) dV dV" 
1 2 = a 


|r — r'| 


space 
ore 


p2(t')o,(r’) dV’ (0-15-72) 
all 
Hence we conclude that _—sPace 


Vin = | palpi (r) dV (2-15-8) 
all 


space 


Now, if the two distributions are azimuthally symmetrical about a common 
axis and arranged in such a way that p, is always outside p, (as in an atom), 
then our expression for the interaction energy can be written in a very simple 
way. We have for 9, 


oe) (1) 


B 

gi(r,0) = ) “rer Px(cos 8) (2-15-9) 
n=0 

where B,") is the 2”-pole moment of the inner distribution p,. Substituting 

into Eq. (2-15-8), we obtain 


Cn = Bae (2-15-10) 


n=0 


where, as usual, 


A, = | Pal Oreos 9) (2-15-11) 
all 


space 


In the event that our charge distribution is an atomic system the part 
corresponding to the electrons is generally well known. Hence the 4,'?? 
coefficients in the above expansion can be found. In addition to this there 
are basic limitations set on the possible values of as the result of fundamen- 
tal quantum-mechanical theorems. For example, the requirement that the 
laws of physics not change when we reverse the direction of flow of time 
in our equations implies that n cannot be odd. Furthermore, the maximum 
value of n is related to the total angular momentum of the system. Only 
total angular momentum values of [,//(/ + Lh]/2x are allowed for a 
physical system, where / is an integer or half-integer and h is Planck’s 
constant. The maximum value of n that we can have represented in a multi- 
pole expansion of the charge distribution may be shown on the basis of 
these fundamental theorems to be equal to 2/. Hence by knowing the spin 
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of the nucleus we can determine how many moments can possibly exist. 
Since the charge is known, the first unknown moment in Eq. (2-15-10) is 
the quadrupole moment. A measurement of this term will now tell us 
whether the nucleus is “pancake” shaped or “‘cigar’’ shaped. A pancake- 
shaped nucleus will have negative quadrupole moment and a cigar-shaped 
nucleus will have positive quadrupole moment. Of course, a sphere has no 
quadrupole moment. 


2-16 THE ELECTROSTATIC STRESS TENSOR 


In all our discussions of the methods for determining electric field and the 
energies associated with these fields we have paid very little attention to the 
forces exerted on charges and charge distributions. We will remedy this 
shortcoming now by showing in detail how the force on a distribution of 
charge can be calculated in either of two ways. We can carry out a volume 
integral over the charge distribution itself in which we multiply the charge 
within any infinitesimal volume dV by the field at that point. Alternatively, 
we will discover a method of converting our volume integral into a surface 
integral in which only the fields on a surface enclosing the distribution of 
interest need be known. 

We begin, of course, with the force on the infinitesimal bit of charge 
in the volume dV. If p is the charge density in the volume and E is the 
electric field there, then 


dF = pEdV (2-16-1) 


Integrating, we find for the total force on a given volume V of charge 
F = | pEdV (2-16-2) 
Vv 


Let us apply this to a very simple problem. Consider a uniform sphere of 
charge of radius R and total charge Q (see Fig. 2-22). We would like to find 
the total force exerted by any one hemisphere of the sphere on the opposite 
hemisphere. For convenience we have set up our z axis as shown and evaluate 


— Fig. 2-22 A charge Q is spread uniformly over the volume 


of a sphere of radius R. We would like to find the force which 
the lower hemisphere exerts on the upper hemisphere. 
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the force on the upper hemisphere by the lower hemisphere. Obviously 
only the z component of this force will remain after integration: 


E= pE, dV ~ (2-16-3) 
hemisphere 


We recall from Eq. (2-2-6) that E(r < R) = Qr/R*. Hence 
1 £ cos 8 (2-16-4) 


where @ is the usual polar angle. The charge density p is just equal to 
Q/$nxR°*. Substituting into Eq. (2-16-3), we obtain 


| 8 sin 6 dé i, ge! 
r= cos @ sin r° dr 
i, 


ay30* 
~ 16 R? 


(2-16-5) 


So far we have added nothing new to our knowledge; we could have 
carried out this calculation 50 pages ago. We will now show that Eq. (2-16-2) 
can be converted into a surface integral where only a knowledge of the 
electric field on the surface is necessary. We recall that p = (1/42) V~ E. 
Substituting into Eq. (2-16-2), we have 


1 
F= ae \, E(V - E) dV (2-16-6) 


We next subdivide Eq. (2-16-6) into its components and make use of a simple 
vector identity and Gauss’ theorem: 


1 : " " 
F -=|| £0 -Bi dv + | 59 -B dv + | £0 - Bk av | 
An V Vv 4 


=| [V (EE) ~ E-Vejiay + | [V-E)-E 
Ebay + | [V - (E,E) - E-vejkav} 
Vv 


Edy 
= Gq |, EE da - x |, (E - V)EdV (2-16-7) 


The surface S, as usual, encloses the volume V. To evaluate the second 
integral on the right-hand side of Eq. (2-16-7), we use the vector identity 


V(E-E) = 2(E- V)E + 2E x (V x E) 
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Since V x E = 0, we can rewrite Eq. (2-16-6) into the form 
1 1 
F = —_| E(E-f) dA — — E? -16- 
a { (E - fi) e- |, V(E*) dv (2-16-8) 


We can now convert the second term into a surface integral by means of 
one of our standard identities. This leads to the result 


1 E? 
as ee {[[ee-m = > dA (2-16-9) 


A good look at the integrand of Eq. (2-16-9) will convince us that it can 
be written as the product of a tensor T and the vector ft. We write 


+ lee ee = Th 2-16-10 
4n yD = ae 
where 
12 
E — = EE. EE. 
1 ; EA 
T= ae Eee, 1 <a a (2-16-11) 
2 
BE. Eye: E,? — = 
We conclude then that 
KF = | Th dA (2-16-12) 
s 


The tensor T is called the electrostatic stress tensor. As we see from Eq. 
(2-16-12), we can obtain the force on a given volume by integrating the 
product of the stress tensor and a unit normal over the surface bounding the 
volume. We will illustrate this technique by referring back to the problem 
we have just solved and again try to find the force on the upper hemisphere 
in Fig. 2-22. 

The surface we choose is arbitrary as long as it encloses the entire 
upper hemisphere and no other charge. There are two natural choices for S. 
One is the complete xy plane and the other is the actual bounding surface 
of the hemisphere itself. We will only use one of these, the complete xy 
plane, and leave the other to the reader as an exercise. 

On the xy plane, E, = 0 and f = —k. Hence, on this plane, 


_— 
= —— E*k 
TA = 
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Using dA = 2ar dr in Eq. (2-16-9), we obtain 


Fat , E?r dr 
1 R O77 i oo) oO 
Ba = See 
re 78 dr + ae 3 
2 
ws a £ (2-16-13) 


Needless to say, this result is identical to the one we obtained earlier by 
more conventional means. 

The utility of this stress-tensor technique will be most apparent in a 
situation where we know the electric fields surrounding a charged object 
but do not know the distribution of charge itself. By evaluating the stress 
tensor on a surrounding surface, we will indeed never have to determine this 
charge distribution if we want to know the force acting on it and nothing else. 


PROBLEMS 


2-1. A charge g is brought to a distance d from the center of a grounded (gy = 0), 
conducting sphere of radius R (see figure). 

(a) Show that the potential at all points outside the sphere can be obtained by 
replacing it with an “equivalent” point charge at some place on the line 
between its center and g. (The equivalent charge need not have a magnitude 
of —q.) 


(b) What is the charge per unit area of the sphere as a function of the polar 
angle 6? 

(c) What is the total charge of the sphere? 

(d) How much force is exerted by the charge g on the sphere? 

(e) Suppose the sphere in the above problem were uncharged, rather than at 
zero potential. What would the answers to parts (b) and (c) become? 


2-2. A slab of dielectric of thickness ¢, length ZL, and width W is inserted between 
two plates of the same length and width with separation d. The plates are con- 
nected to a battery, with a potential difference V. Find the force on the dielectric 
when it has been inserted a distance y. 
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2-4. 


2-5. 


a yp — ee 


5] 


= 


Top view 


Find the energy stored in a uniform, spherical charge distribution of radius R 
and total charge Q. 


A charge Q is deposited on a spherical conductor of radius R. What is the energy 
of the distribution? 


Two long, concentric conducting cylinders have radii a and b, respectively, 
and are each of length /. The space between them is filled with material having 


\ 


/ 
t 
\ 
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dielectric constant «. If the potential difference between the cylinders is V, 
find the total energy stored in the fields between them. 


2-6. Consider the plane interface between a region with dielectric constant ¢ = 1.3 
and a region with dielectric constant 1.6. The electric field in the first region is 
at 45° to the surface. What is the direction of the electric field in the second 
region? 


13 | € = 1.6 


2-7. A spherical conductor of radius a is surrounded (as shown) by a concentric 
dielectric with dielectric constant ¢, inner radius b, and outer radius c. These 
are in turn surrounded by a conducting shell of radius d. If a charge +Q is 
placed on the inner conductor and — Q on the outer conductor, find the electric 
field E and the potential ¢ at all points between the conductors. 


2-8. A dielectric separates two conducting plates. The area of the plates is 4 and the 
separation between them is t. The dielectric constant varies linearly as 


& = & + kx forOSxSt 


A charge + is placed on one plate and —Q on the other. 
(a) Find D between the plates. 

(6) Find P everywhere between the plates. 

(c) Find E everywhere. 

(d) What is the potential difference between the plates? 
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2-9. 


The space between two concentric conducting spherical shells is half-filled with 
material of dielectric constant ¢, as shown. The radii of the shells are a and 5, 
respectively. A charge Q is placed on the inner sphere and a charge —Q is 
placed on the outer sphere. Find the fields E, D, and P at all points between 
the conductors. 


2-10. Find the six independent coefficients of capacitance for three concentric spherical 


2-12. 


conducting shells, having radii a, b, and c, respectively (a < b < c). Choose 
two sets of values for ,, ~, , and show explicitly that Green’s reciprocity 
theorem works. 


. A charge Q is brought from infinity to the neighborhood of an isolated un- 


charged conductor. The fields due to the induced charge distribution on the 
conductor do an amount of work W on @Q as it is brought in. We now “freeze” 
the surface charge in place on the conductor and remove Q back to infinity. 
How much energy is stored in the remaining electric field distribution? 


Consider an atomic system which consists of a proton at the center of a uniform 
negative spherical charge distribution of radius R = 0.5 x 107° cm. The total 
system is uncharged. A field E is now applied to the system causing it to become 
polarized. 

(a) Calculate the induced dipole moment. 

(b) Calculate the amount of work that the applied field has done in moving the 

proton from the center of the charge distribution to its new position. 

(c) Calculate the amount of field energy that would be present in a volume of 
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4nR° having constant field E within it. How does this compare with the 
answer to (b)? 


2-13. A conducting spherical shell of radius R is cut into three segments, as shown, 
extending, respectively, from 0 = 0 to 0 = 60°, from 6 = 60° to 6 = 120°, 


and from @ = 120° to 6 = 180°. They are insulated from one another. The 
uppermost and lowermost segments are grounded (y = 0) and the central 
segment is held at potential @p. 

(a) Find the first four terms in a multipole expansion of the potential forr > R. 
(b) What is the total charge on the sphere? 

(c) What is the quadrupole moment of the charge distribution on the sphere? 
(d) What is the potential at the center of the sphere? 


2-14. Making use of the stress tensor, demonstrate that the force per unit area on the 
surface of a conductor is 2a? where o is the surface charge per unit area. 


2-15. Demonstrate that the capacitance of any conductor is always smaller than or 
equal to the capacitance of a conductor which can completely surround it. 


2-16. A flat circular disk of radius R has a charge Q distributed uniformly over its 
area. Show that the potential @ at point (7,6) where r > R is given by 


4 


r 


2 
o(r,6) = ak _ z(+) P,(cos 8) + (4) P,(cos 8) 


6 
= al) P.(cos 6) + -- ‘| 


2-17. A charge Q is distributed uniformly along a line coincident with the z axis from 
z= —a to z = +a. Show that the potential at a point (r,9) where r 2 a is 
given by 


2 
g(r,6) = aL + 3(4) P, (cos 8) + (4) P,(cos 8) + :: ] 


3 
Electromagnetism and Its 
Relation to Relativity 


3-1 INTRODUCTION; THE MICHELSON-MORLEY EXPERIMENT 


We begin our study of electromagnetism in a highly unorthodox way— 
by deriving much of it from electrostatics, from the special theory of 
relativity, and from the underlying hope that the laws of physics, when 
looked at properly, are elegantly simple in their formulation. This is quite 
antithetical to the usual notion that physics is an empirical science and 
should be presented that way. The beauty of physics lies in the extent to 
which seemingly complex and unrelated phenomena can be explained and 
correlated through a high level of abstraction by a set of laws which are 
usually amazing in their simplicity. In the history of this abstraction, no 
triumph has been more spectacular than electromagnetic theory. 

105 
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To develop electromagnetism in its most beautiful form we must 
begin with the special theory of relativity. (Historically, of course, the 
roles of electromagnetism and relativity were reversed, and relativity was 
derived to explain the fact that Maxwell’s equations were not invariant 
under a galilean transformation.) We will first describe the Michelson- 
Morley experiment, which played a rather crucial role in the path to 
relativity. 

At the turn of the century, one of the more intriguing problems for 
the experimental physicist was a measurement of the velocity of the “ether,” 
that is, the medium which carried light. As we all know, the velocity of 
sound varies depending upon our velocity relative to the air. If v, is the 
velocity of sound in still air and V is our velocity relative to the air, then 
we observe a velocity v, for the sound given by 


y=¥-V (3-1-1) 


Presumably, it was argued, light also has a fixed velocity relative to some 
medium which permeates all space, and if we find the velocity of that 
medium relative to the earth, we will be able to calculate the velocity of 
light in any direction as seen by our earthbound observer. To this end, 
Michelson and Morley designed an ingenious experiment which was to 
revolutionize the entire concept of space and time. 

There were already some indications of trouble before Michelson 
and Morley came along. Historically all the basic laws of electromagnetism 
were already understood by that time and had been formulated by Maxwell 
in terms of his elegant set of equations. As we shall see, these equations 
embodied all electrostatics and magnetostatics as well as the possibility of 
producing electromagnetic radiation by accelerating charges. Now Max- 
well’s equations contain a constant c which plays two vital roles. On the 
one hand, it is the ratio between the electrostatic unit of charge and the 
electromagnetic unit of charge. In other words the magnetic field produced 
by a moving charge will depend on the’ratio of its velocity to c as will the 
force felt by a charge moving in a magnetic field. On the other hand, c is 
also the velocity with which electromagnetic radiation will propagate. If 
indeed there were an ether, then the form of Maxwell’s equations would 
change from reference frame to reference frame. In the reference frame of 
the ether itself the velocity of radiation could be considered a constant 
independent of direction. In any other frame of reference the velocity of 
light would depend on direction. Would that mean that the force between 
current loops would then depend on their orientation relative to the ether 
flow? Nothing in Maxwell’s equations allows for this possibility at all, 
and they would obviously need extensive modification and correction. 
Notwithstanding these problems it was still considered of the utmost 
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importance to make a direct measurement of the velocity of the ether 
relative to the earth. This Michelson and Morley set out to do.! 

Inasmuch as the ether has no problem in passing through matter 
(it could not be removed from an evacuated container) and would hence 
not be dragged along by the earth, one would anticipate a variation in the 
velocity of light on the earth’s surface as the earth traveled about the sun. 
The velocity of the earth relative to the sun is about 10~* times the velocity 
of light in ether, and hence variations of this order were anticipated in the 
relative velocity of earth and ether over the course of a year. 

An idealized sketch of the apparatus is shown in Fig. 3-1. It is assumed 
that the ether is moving to the right along the positive x axis with velocity V. 
A light beam, initially traveling in the positive x direction, strikes a half- 
silvered mirror A inclined at 45° to the x axis. Half the light continues 
through to a mirror B at a distance L, and back again. The other half 
reflects into the positive y direction at mirror A, goes on to mirror D ata 
distance Lp, and then returns to A. The returning beams from both B 
and D recombine to produce beams going in the negative x and y directions. 

[ Now in practice the observer looking into A from either the negative 
x or negative y direction will see an interference pattern, as light which has 
passed through a given path on one arm meets light which has passed 
through a corresponding path on the other arm. Optical paths are made to 
vary somewhat from the “‘ideal’’ to produce a series of “fringes” as the 
observer looks into A. We will continue to assume, though, for the sake of 


all 


x 


“Ether” velocity V 


— — — — — — — —__»— —_ 
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Fig. 3-1 An idealized sketch of the Michelson-Morley apparatus. 


1 The author takes no responsibility for the correctness of the historical exposition, Not having 
studied the history he has imagined what it must have been like and has arranged it to suit. 
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argument, that all beams are perfect, parallel beams traveling either 2L, 
or 2Lp as they go through excursions (A > B > A) or (A > D = A).] 

Let us calculate the time for the paths A —- B-> A and A> D-—A 
in the classical manner, by imagining ourselves moving with the ether. 
We now see the light moving with velocity c, and our calculation requires 
only that we know how far the light must travel. 

We see the light beam heading toward B, but we see B coming to 
meet it with velocity V. Hence 


yo a Vigag 
ee —— 
Similarly 
Lg + Vtg.4 
lb44 = — ~~ 
Hence 
i 
laoa =~ aay (3-1-2) 
Os: 
tpaa = (3-1-3) 
and 
25 
; (3-1-4) 


es = > aL 
BRA ee) 


As we watch the light path 4 + D > 4A, it appears as in Fig. 3.2. The 


distance the light must travel is 2\/Lp? + V?t,.p.42/4, and hence the 


time it takes must then be 


2/ Lp ey ep 
laspD+4 = LL — 7 


D 


a Light path 


Lp 


—+—____. A Fig. 3-2 The path of the light from A to 
Path of A D to A as seen from the “‘ether.” 
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Thus 
0 


If we were now to adjust L, and Lp for maximum constructive interference, 
we would have 


(3-1-5) 


na 
Ve A (Spo, 
c 
where / is the wavelength of the light. Without loss of generality we can 
set n = O if L, and Lp are about equal. We have then 


Ls 


Jia Fie oe) 


We can now turn the apparatus through 90° so that the ether is moving along 
Lp. The interference should change, of course, as we carry out this rotation. 
We have obviously arranged it, according to Eq. (3-1-6), so as to have 
Ly slightly longer than L,. After rotation, however, this relationship will 
no longer do, in general, for maximum constructive interference. 

We can investigate the sensitivity of our apparatus by asking how 
long the arms must be in order that maximum constructive interference 
changes to maximum destructive interference as we rotate. We assume that 
V is equal to the velocity of the earth, namely, 10~“c. In our rotated system 
the times are now given by 


ig 


Lp = 


Se eee Sey 
TA+D+A el — V2/c?) (3-1-7) 
and [making use of Eq. (3-1-6)] 
ra = 2L, 
pie i e/1 — V*/c? 
ied (3-1-8) 


c 


Hence, letting the difference between these times be equal to half the wave- 
length divided by c, we have 


pf | Ss) 4A 
e \1 — V/c? ~ 2e 


This leads to the result 


A 


aan 3-1-9 
4? (3-1-9) 


Lp 
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If we now take V = 10°*c and 4 = 4 x 10°° cm, we find that Lp must 
be 10 meters long. This would be a formidable piece of hardware if it were 
built according to our model above. In practice the length is achieved by 
means of a long series of reflections, but the basic principle is unchanged. 
In any case, the observer watches the interference fringes carefully as 
the apparatus is rotated and looks for a shift in the pattern. 

Can you imagine the excitement that Michelson and Morley felt 
when they observed no change at all in the interference pattern as they 
rotated the apparatus? The immediate explanation that probably occurred 
to them was that at this particular season the earth and the ether must be 
moving together. So they would have to wait 6 months to really be sure! 
It was probably a very difficult 6-month wait, but, needless to say, no change 
in the interference pattern was ever seen. Out of this experiment grew one 
of mankind’s most beautiful ideas: Einstein’s special theory of relativity. 


3-2 THE LORENTZ TRANSFORMATION 


Of course, there were many attempts at finding alternative solutions to the 
predicament posed by the Michelson-Morley experiment, but we will, in 
keeping with Einstein, take the very simplest. We shall assume that the 
velocity of light is a constant of nature and that observers in all frames of 
reference moving with constant velocity with respect to one another measure 
the same constant. This immediately explains the Michelson-Morley experi- 
ment and leads us directly to the Lorentz transformation for converting 
the coordinates (x,y,z,t) of an event in one frame 2 to the coordinates 
(x’,y’,2’,t’) of the same event in an equivalent frame &’. Equivalent frames 
will be taken to mean frames which move at constant velocity relative to 
each other. 

To begin our derivation of the Lorentz transformation, let us assume 
that system &’ is moving with velocity V along the x axis of system & (see 
Fig. 3-3a). Let us further assume that the x’ axis lies superimposed along 
the x axis and that the y’ and z’ axes are parallel, respectively, to the y 
and z axes. Both clocks (¢ and ¢’) will be set at zero when the origins of X 
and 2’ coincide. 

We next show by trivial argument that y’ = y and z’ = z. Let us take 
two absolutely identical twins and stand one along the y axis and the other 
along the y’ axis and have the twins approach each other. Now, for the 
sake of argument, let us assume that twin John in the X system believes his 
brother Jim to be shorter. Then, since we assume neither system to be 
preferred, Jim must think that John is shorter. As they collide we have a 
most remarkable set of circumstances—John sees Jim’s head plow into his 
own belly as he feels his own head plow into Jim’s belly. Quite absurd!! 
There is only one sensible solution, each still sees the other as being of the 
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(a) 


Fig. 3-3 (a) The coordinate system ©’ moves with velocity V 
along the x axis of the X coordinate system. John is riding along 
in the © system while Jim is in the L’ system. 


same height as himself. That is, dimensions transverse to the direction of 
motion are unaltered in a Lorentz transformation. 

Now we come to the matter of clocks. We would like to examine how 
each of our twins sees the other’s clock in relation to his own. Since the 
velocity of light c is a fundamental constant of nature, the “ideal” clock, 
so to speak, is a perfect stick of length L with a mirror at each end and a 
bit of light beam bouncing back and forth. Let each twin align his clock 
along the y (or y’) axis. Each twin sees his own light beam take a time 
t = 2L/c for the complete cycle. Meanwhile he sees the other twin’s light 
take an amount of time £,,,.,. Of course, fo,-, is larger than t, since the 
light must travel along further (see Fig. 3-35). The path of the other light is 


( Visca? 


Pathigns — 2) ee ae 4 


Consequently 


pa path other 
ther ~~ 
other c 


and, solving for fotners 


ab Af 
t 


omer ON -Vye fi —- Ve 


(3-2-1) 
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Fig. 3-3 (cont’d) (b) Path taken by Jim’s light beam as seen by John. 


Thus each sees the other’s clock moving more slowly, by a factor of 


1 
J1 — V2/c? 


Having developed a means of comparing clocks in two equivalent 
coordinate systems, we now attempt a comparison of lengths along the 
direction of relative motion. Let John, in the 2 system, lay his standard 
clock, of length L as he sees it, along the x axis. We presuppose now that 
Jim in the &’ system measures the length of the rod as L’, and we wish to 
relate L’ to L. John (2) believes that the light takes a time ¢ = 2L/c fora 
complete cycle, and, as we have already determined, Jim thinks it takes 
longer, namely, 2L/c./1 — V?/c?. Now let us visualize the entire process 
as seen by Jim (2’) (see Fig. 3-3c). He sees the transit of light to the right 
as taking an amount of time tz. At the same time he sees the end of the rod 
moving toward the light beam with a velocity V, and hence the distance 
covered by the light beam in going from left to right (A to B’)isjust L’ — Vt. 
The light having traveled at a velocity c has thus taken a time tk = 
(L’ — Vtg)/c, yielding 


ip = £ 
Rk el + Vic) 
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(c) 


Fig. 3-3 (cont'd) (c) The X system as viewed by an observer in the 
&’ system. L’ is the apparent length of £’s clock. The path indicated is 
followed by 2’s light beam in the course of one cycle. 


Similarly we would find the time from B’ to A’ as 


i= fi 
ETAL Vie) 
Using our information about clocks, we have 
OL i 25 1 


bot f= = 
Fe ie Ve mee |? 
yielding 
V2 


2.) 1 ib 
a 
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(3-2-2) 


(3-2-3) 


We thus come to the following conclusion. Each observer determines 
distances along the direction of relative motion to be foreshortened in the 
other coordinate system. Jim thinks that John is abnormally thin in one direc- 


tion, and Jim, of course, thinks the same about John. 


Having learned how to compare meter sticks and clocks, we are now 
ready to derive the full Lorentz transformation. Let us say that an event 
occurs at position x and at time ¢ in the & system. The observer in the 2’ 


system reasons as follows: 


1. When both clocks were at zero, then the point x would appear to him as 


fy? 
le i 
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2. During the time interval ¢’ (before the event occurs as seen by ~’), 
the & system moves in the negative x’ direction by an amount Vr’. 
3. Hence we conclude that 


| Vv? 
XS eee 
G 


Solving for x, we have 


ce ee ey (3-2-4) 
/l = V7 se 


By symmetry we can also say 


=i 
¢ + Ja (3-2-5) 


/1 — V7 /c? 
Combining these equations, we have 


pile + (Vi[c?)x’ 


Ji- Ve Saal, 
= 2 
toe t — (V/c*)x (3-2-7) 


J/1 — V?/c? 


To make our notation simple, we define the symbols f and y by 
V 
B= — (3-2-8) 


—_— (3-2-9) 


Ji-# 


Let us now set down the matrix 


y 0 0 iBy 
0 1 
L= where i = \/-1 (3-2-10) 
0 0 
Spy Oey 


We will call the element in the uth row and vth column Ly. If we now 
write 


xa y X= ict (3-2-11) 
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then the Lorentz transformation can simply be written as 
4 


xX, = x | Ss (@22-12) 


We note immediately the formal similarity with equations in three-dimen- 
sional space. For example, the analog of Eq. (1-2-3) is 


4 
oy Pry lioy. = On, (3-2-13) 


In terms of our four-dimensional notation, the old rotation matrix a 
(see page 5) is now written asR: 


41, G2 a3 0 


a a a 0 
R= 21 922 43 (3-2-14) 
43; 432 433; 0 


0° 0° 0a 


A pure rotation in three-dimensional space would then be expressed as 
Xi, = ¥ Ry Xy (3-2-15) 


Now R and L are both subsets of the set of all four-dimensional rotations 
(called the Lorentz group). If we wish to find the Lorentz transformation 
between two coordinate systems whose relative motion is along some 
direction other than the x axis, we can apply the succession of rotation, 
Lorentz transformation, and inverse rotation to achieve our ends. 

The set of four quantities (x,,x.,x3,X,4) is a four-vector in this four- 
dimensional space. As we anticipate, its length remains invariant under 
any Lorentz transformation: 


HP yg? Sete x? + yee? — cH? (3-2-16) 


All sets of four quantities which transform as (x,,xX2,x3,X4) under this 
group of transformations will be called four-vectors. Scalar products 
of four-vectors are made in the same way as before. If u and w are four- 
vectors, then the scalar product is 


4 
uw= ) uw, (3-2-17) 
u=i 
For convenience we will also adopt the convention that Greek subscripts 
(u,v,etc.) refer always to the numbers 1, 2, 3, 4, while Latin subscripts 
(i,j,k,etc.) refer only to the numbers 1, 2, 3. 
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The definition of a tensor of the second rank in four dimensions is 
quite analogous to that in three dimensions. The 16 elements transform 
like the set of component products of two four-vectors. There is nothing 
analogous to the vector product in four dimensions, since, as we have seen, 
the vector product is a special tensor of the second rank in three dimensions. 
We shall have more to say about this shortly. 

In any case, we have now developed enough formal machinery to 
enable us to transform the position and time of an event as seen by one 
observer into the position and time of the same event as seen by an equiva- 
lent observer. We can now take a look at the mechanics of a moving particle 
and see how the dynamical quantities which characterize its behavior can 
be transformed as we go from one to another of these coordinate systems. 

We begin with the simplest dynamical quantity, its velocity, which 
we call u in the © system and w’ in the L’ system. As usual, 2’ moves with 
velocity V along the positive x axis. We have 


dx’ dx — Vat 


/ 


Ms at ~ dt — (Vie) dx 

a (3-2-18) 
ul, = - =e (3-2-19) 
u, = - ae (3-2-20) 


The most interesting of these velocity transformation formulae is 
Eq. (3-2-18) relating u, and u’,. In the old days, before we heard of relativity, 
we would have simply let uv. = u, — V. Thus if a particle was moving along 
the negative x axis with velocity u, = —c and the X’ system was moving 
along the positive x axis with velocity 3c, we would naturally expect the 
x’ observer to see the particle moving with velocity 1.5c. It stands to reason! 
Now we say, instead, that u, = —1.5c/(1 + 3%) = (—24/25)c. Very odd 
indeed. In fact, try as we may, as long as |u,| < cand |V| < c we can never 
have |u| greater than c. 

We next consider the component of momentum of the particle that is 
transverse to the direction of relative motion of the two coordinate systems. 
This component must be invariant under Lorentz transformation from one 
of these systems to the other. Physically, this is easiest to understand by 
placing oneself first in one frame and then accelerating to the other frame. 
As we accelerate we see fictitious forces causing a change in momentum 
in the x (or x’) direction but never in the y or z directions. (We could not 
even know how to define the y or z directions on the basis of the relative 
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velocity of the two coordinate systems above.) Making use of this require- 
ment, and freeing ourself of the notion that mass is an invariant scalar quan- 
tity under a Lorentz transformation, we have 


mu, = m'u, (3-2-21) 
Inserting our previous result from Eq. (3-2-19), we have 
m u 


te J 


mu, yi —u,V/c*) — 


The expression on the right can be simplified as follows. We first calculate 
1 — u’*/c? by inserting the expression 
wre ue tu? +? 
_ (uy — VY? + 4,21 — V7/c?) + u,2(1 — V?/c?) 
(= wey 
Simplifying and combining terms yields 
uv? (1 — w/e?)(1 — V?/e?) 


(Cee 


c? (1 — u,V/c?)? 
or 
u,V Lage 


We will find this expression quite useful. Returning to Eq. (3-2-22), we can 
now write 


ion ates eile 
ee) t= Po lS @ (3-2-24) 


We see thus that the expression m./1 — u?/c? is an invariant constant 
under a Lorentz transformation. We will call it mo, the value it has when 
u = 0. We have then 


Mo 


J/1 — u?/c? 
The quantity mp is referred to as the rest mass of the object. Having found 


the way to transform m as we go from one system to another, we are now 
ready for the x component of momentum. As usual we write 


a (3-2-25) 


if Yee 
Px = Mt, 
eee. 


oT Se 21 — u,V/c2 


(3-2-26) 
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But, using Eq. (3-2-23) again, we obtain 


poe eV) (3-2-27) 


fare 


Putting all of what we have learned together, we finally come up with the 
equations 


ie V Cc 
= --——m 
kame 


Py = Py 
Pz = pz 


eae 
m'c = y\ me — ——Px 


Lo and behold, the set of four entities (p,, p,, p,, imc) transforms as a four- 
vector under a Lorentz transformation! 

We now make a most remarkable discovery—that momentum con- 
servation, if it is to be independent of coordinate system, implies energy 
conservation. Imagine a collision between particles () and @) giving rise 
to particles @) and @. Momentum conservation tells us that 


Pi + Po = P3 + Pa (3-2-29) 


Let us form the four-vector p; + py — p3 — Da: 


(3-2-28) 


Pi + P2 ~ Ps — Pa = [Pi + Pa — Ps — Ba, my + m, — mz — m,)c] 

= [0, im, + m, — m3 — m,)c] (3-2-30) 

If the first three components are to remain zero for any Lorentz transforma- 
tion, then the fourth component must also be zero. Hence 

m, +m,=m;,+m, - (3-2-31) 

To bring this equation into correspondence with classical physics we note 


that for small u 


Moc? 1 
me = 2 imac? 1+ — 
1 — u/c? 2c 


~ mc? + 4mou? (3-2-32) 


| = 
N N 
n—_—” 


This is then equal, for small u, to a constant plus the kinetic energy of the 
mass. We then define the energy of our mass as mc? and observe that energy 
conservation is a direct consequence of momentum conservation. 
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We can derive a very useful relationship between energy and momen- 
tum by making use of Eq. (3-2-25). We let mc? be denoted by the symbol E 
and recall that 


Pd ad 
2 aa eS 
i an ea 
Hence 
my*c* 
FE? = 0 = ee 2,4 >. 
Pane prc* + mo°e (3-2-33) 


Another equally useful expression relates the velocity u to the ratio of 

pe and E. Making use of Eqs. (3-2-25) and (3-2-33), we can easily obtain 
mye PE ips 

p= - E (3-2-34) 

Before we leave the subject of relativistic kinematics, we will say a 
few words about a system of units which plays a predominant role in dealing 
with atomic, nuclear, and subnuclear phenomena. This system is based on 
the electron volt as a unit of energy. The volt is a unit of potential difference 
in the mks system and can be related to our unit of potential difference, 
the statvolt, by the equation 


1 statvolt = 107 8c volts (3-2-35) 


where c is the velocity of light in centimeters per second (107 °c is about 
300). The electron volt (eV) is then the amount of energy which corre- 
sponds to moving a charge equal to the charge of an electron (or proton) 
through a potential difference of 1 volt. Since the charge of the electron, e 
is equal to 4.8 x 10~}° esu, we have 

A Sox lina’ 


—) a = ae = alla a) 
leV= 300 enas'— 16x 10 “ergs (3-2-36) 


Next we introduce the units of momentum and mass which go with 
the electron volt in common usage. Both are obtained from our energy 
units by allowing c to equal | in our kinematic equations. The unit of 
momentum is called the eV/c and the unit of mass is called the eV/c?. The 
rest mass of a particle whose rest energy is moc? eV is thus my eV/c?. (In 
Table 3-1 we list the rest masses of some of the more important elementary 
particles.) The momentum (in eV/c) of a particle whose mass and energy 
are known is obtained from Eq. (3-2-33): 


— 2 2 
PineVic ramen Oe Mo 
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where E is in eV and mz is in eV/c?. The velocity of a particle is given by Eq. 
(3-2-34): 


pe | (3-2-37) 


where p is in eV/c and E is in eV. Although this system of units will seem 
odd to the student at first, he will soon become accustomed to the great 
advantage of not having to carry around c’s everywhere. 

Let us illustrate the methods discussed above by calculating the velocity 
of a proton which has acquired a kinetic energy of 500 MeV (a MeV is 
10° eV). Referring to Table 3-1 we see that the mass of a proton is 938 
MeV/c?. Hence its rest energy is 938 MeV. Adding this to the kinetic energy, 
we obtain 


E = 938 MeV + 500 MeV = 1438 MeV 
We next find the momentum p. 
p = ./(1438)? — (938)? = 1090 MeV/c 


Finally we determine the velocity of the proton relative to the speed of light: 


TABLE 3-1 Tabulation of the masses of some 
elementary particlest 


Particle Symbol Mass (in MeV/c’) 
Electron et 0.511006 + 0.000002 
Muon pres 105.659 + 0.002 
Charged pion To 139.578 + 0.013 
Neutral pion n° 134.974 + 0.013 
Charged kaon Ka 493.82 +011 
Neutral kaon K° 497.76 + 0.16 
Eta n° 548.8 + 0.6 
Proton p 938.256 + 0.005 
Neutron n 939.550 + 0.005 
Lambda A° BES Sy) + 0.07 
Sigma plus Dy 1189.43 + 0.17 
Sigma zero =° 1192.55 + 0.11 
Sigma minus =a 1197.42 + 0.09 
Xi zero =° 1314.7 + 0.7 
Xi minus =" 1321.25 + 0.18 
Omega minus Qr 1672.4 + 0.6 


+From compilation of Prof. A. Rosenfeld. 
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3-3 CHARGE DENSITY AND CURRENT DENSITY AS 
COMPONENTS OF A FOUR- VECTOR 


We remember from elementary physics that current is somehow related 
to the movement of charge. In this section we will put this notion on a 
quantitative basis and then show that current density and charge density 
are closely related as components of a four-vector. This is not altogether 
surprising for we anticipate that stationary charge density as seen from one 
reference frame will appear to be moving in another reference frame. 

It is simplest to begin with a collection of discrete charges which are 
sufficiently closely spaced so as to be effectively a continuum. We will assume 
that the charge q; has a velocity v,, and we then define the current density j 
at a given point by the equation 


Lavi 
jeu = lim —— (3-3-1) 


Aro AT 
where Art is a small volume element about the point of interest. (We have 
attached the subscript esu to indicate that the current density so defined 


is in electrostatic units.) Of course the charge density in this case is defined 
as before: 


4 


p = lim a= G22) 
Ar70 


This definition of current density is quite in keeping with our intuition. 
If we want to determine the charge per unit time flowing past a bit of surface 
dA in the direction. of the normal vector fil, we just evaluate j,,,.° fi dA. 
The flux of j out of a given volume is just the charge leaving it per unit time. 


Rate at which charge is leaving AV = | Jesu * dA (3-3-3) 
surface 
of AV 
We now make a most important empirical observation. The total 
charge that a given object carries is an invariant with respect to a Lorentz 
transformation. Were this not the case we would have a very insane world 
indeed. As we heated a bit of matter, the average electron velocity would 
increase quite differently from the average nuclear velocity. Hence, if the 
negative charges of the electrons just canceled the positive nuclear charges 
at one temperature, they would not do so at all temperatures. The body 
would then change its overall apparent charge with changing temperature, 
and, remembering the basic strength of the electrostatic force (see page 34), 
disastrous consequences would follow. Fortunately, when the laws of 
physics were first set down, this problem was averted through the Lorentz 
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invariance of total charge. (As we have seen, this invariance does not hold 
true for the total mass of an object. Its mass does increase as it speeds up. 
However, because the net gravitational force on matter is not the resultant 
of a delicate balance of opposites as it is in the case of electrostatics, the 
consequences are negligible.) 

We now ask how j,,,, and p transform under a Lorentz transformation. 
Since not all our charges are necessarily moving with the identical velocities, 
we must break our charge and current distribution into subdistributions. 
Each of these subdistributions will consist of all charges that do have more 
or less identical velocities. We can treat each of these subdistributions 
independently as we go from one frame of reference to another and add 
them together at the end. 

For the typical case we have j.,, = pv where v is the velocity of the 
subdistribution. We transform into the system X’ moving with velocity v, 
in which case 


p= (3-3-4) 
p’ = Po = charge density in rest frame of subdistribution 


When we transform back to the original system 2, all distances along the 
transformation direction are compressed by the factor ./1 — v?/c?, and, 
in order that the total charge remain constant, the charge density must 
increase by the factor 1/./1 — v?/c?. Hence in the £ system we have 


Po 
Tp 
wid ele? 


Since j.,, = pv in this system, we also have 


_——— Pov 
Jesu = Jl — ee (3-3-6) 


We notice that these equations are.completely identical in velocity depen- 
dence with the previously derived equations for mass and momentum: 


(3-3-5) 


Mo 
m= 
1 — v*/c? 
MoV 
p — 
1 — v?/c? 


But the four quantities p,, p,, p,, and ime transform as a four-vector under 
Lorentz transformation. The conclusion is clear. The four quantities 
Jxtesu)> Jesu)» Je(esu)s 4o¢ Must transform as a four-vector. 

At this point it is convenient to make a change in our units. We intro- 
duce the so-called gaussian units, wherein p is expressed in electrostatic 
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units as before but j is expressed in electromagnetic units. The definition of 
the electromagnetic unit is 


Jeni = (3-3-7) 
c 
We shall no longer carry the subscript emu or esu but tacitly assume that 
j = Jjemu- The four quantities /,, j,, j,, and ip constitute our new four-vector. 
Consider now how we would express conservation of charge within a 
given coordinate system. The rate of change of charge in a volume is equal 
to the charge entering the volume per unit time. Using Gauss’ law, we can 
write this as 


vj — — 0 (3-3-8) 


Note then that the left side of the equation is just the scalar product of the 
two four-vectors [(0/dx, 6/dy, 0/8z, 1/ic(d/ét)] and (j,, j,, j,, ip) and is thus 
invariant under Lorentz transformation. If the conservation of charge 
holds in one coordinate system, it will hold in any equivalent system! 


3-4 THERE MUST BE A “MAGNETIC FIELD’! (THE REQUIREMENT 
OF LORENTZ INVARIANCE IMPLIES A VECTOR POTENTIAL) 


Until now our study of relativity has been largely limited to kinematics. 
That is to say, we have learned how to transform some of the variables that 
describe a dynamical system from one coordinate system to another, but 
we have not discussed at all the laws of physics which govern the behavior 
of these variables. We will now begin to do so with a very simple assumption. 
We will assume that the force on a charged particle in its own rest frame is 
given by the electric field as the particle sees it. Furthermore, we will assume 
that if the charge and current distributions that it sees in its own rest frame 
are not changing with time, then the electric field it sees will be determined, 
just as in the case of simple electrostatics, by Eq. (2-1-4). 

The consequences of these innocent-sounding assumptions are abso- 
lutely fantastic. We will now show that the electric field cannot provide a 
complete description of the long-range forces resulting from charges and 
currents, but that there must in addition be other field quantities at each 
point in space which exert a velocity-dependent force on a moving charged 
particle. Thus we will deduce that there must be a magnetic type of force. 
Later we shall see how to express this force in terms of magnetic field. 

We begin with an exceedingly simple problem. We consider an in- 
finitely long cylindrical region of radius R which carries a constant current 
density j but no charge density (p = 0). We can achieve this by having an 
equal density of positive and negative charges moving in opposite directions. 
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A charge q is now projected with velocity v parallel to the current, at a 
distance r from its axis. We wish to find the force exerted on the charge 
(see Fig. 3-4). 

We first note that nothing we have done in electrostatics would seem 
to indicate that there should be any force whatsoever on the charge q. 
Any force we discover now will be new. Of course, we anticipate from 
previous experience that there should be a “‘magnetic” force on q, but 
as we will now show, this state of affairs is completely deducible from 
electrostatics and the Lorentz transformation. 

We begin by doing the obvious, jumping to a system in which the 
charge is at rest. In this system L’ moving with velocity v with respect to 
the laboratory £, we see the current density j’ and the charge density p’ 
given by 


. , P 
Jy FY EF oa 
enue ce 
ip’ = —iByj, 


That is to say, we see a negative charge density for r < R equal to 


1 a Uj, 
p= ——— 3-4-2 
./ = wale? ! 
The electric field set up at the radius r is radial in direction and is just (from 
Gauss’ law) 
2nR*p’ 
(ae —— (3-4-3) 
If we let J = 1R?j, = total current, we have 


: 2Iv é 
| i (3-4-4) 


c./1 — v?/e?r 


Fig. 3-4 Acharge q is projected with velocity v parallel to a current distribution j. Forr S R, 
j = constant, p = 0. Forr > R,j = 0,p = 0. 
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Now, the force on our charge q is just gE, in this system. But this force is 
also equal to dp;/dt in the Z’ system where 1 is the particle’s proper time 
and p, is its transverse momentum. Hence 


dp, non 2Ivq 


= SSS Se 3-4-5 
dt c./1 — v?/e?r \ ) 
Going back to Z, we note that 
dp, = dp, 
and 
io aa. (3-4-6) 
1 — v7/c? 
Substituting back, we find the force on q in the laboratory. 
dp 2Ivq 
ee, oes Be 
ae ai cr Cg) 


As we see, this is just equal to what we would have predicted from magneto- 
statics, as we learned it in elementary physics. We would have said 

F,= -q = B and B= ~ 

So we have made a major discovery. A charged particle moving in 
the proximity of a current distribution has a force acting on it, even though 
there is no other charge density present! Unfortunately though, life is not 
quite so elementary, for if we try to project the charge q in the y direction 
(toward the axis of the current distribution), we run into trouble with our 
simple reasoning. If we jumped on board our particle, we would see no 
apparent change in the current density and we would see no charge density 
at all. This is because the current j is now transverse to our velocity of trans- 
formation and there is no longitudinal current or charge density in the 
laboratory. What do we do now? Do we conclude that the charge gq will 
now have no force on it? Of course not! We have gone outside the area of 
validity of electrostatics because we now see a current distribution which is 
changing with time. It is approaching us with velocity —v, and we have no 
right to expect that electrostatics will give us the full answer. So we must now 
go back and think out our whole problem from scratch. 

To feel our way, let us recall the equation relating potential to charge 
distribution: V9 = —4zp. It is clear that this equation is not properly 
invariant under a Lorentz transformation. To make it so, the operator V? 
has to be replaced by something which transforms as a scalar in four- 
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dimensional space. Obviously we want 
‘il C A ae [aes 
Ox? Ope ae? Ce 

The symbol on the right, called the D’Alembertian, reduces to the laplacian 


V2 in the event that there is no time dependence to ¢ and has the right invari- 
ance under Lorentz transformation. The proper equation for ¢ is then 


De = —4np (3-4-9) 
Since p is the fourth component of a four-vector, g must also be so. Hence 


we must have three other components A,, A,, and A, such that the set 
of objects (A,, A,, A,, ig) is a four-vector and 


=] (3-4-8) 


OA, = —4nj, (3-4-10) 
OA, = —4nj, (3-4-11) 
OA, = —4nj, (3-4-12) 


In the event that j and p are not time dependent, we can rewrite these equa- 
tions in integral form just as in electrostatics. 


p(t’) 
= pat gy" 3-4-13 
Pp {, Ir = r’ ( ) 
space 
A | IO ay (3-4-14) 
all Ir or 


In summary then, the laws of electrostatics must be generalized by intro- 

ducing a four-vector potential A, corresponding to the four-vector current 
j, such that 
4 792 

OA, 

v=1 dx? 


= —4nj, (3-4-15) 


This equation represents our first breakthrough in understanding 
how to generalize from electrostatics. We see that one component of poten- 
tial is inadequate but that we must have four components in order that our 
old equations for electrostatics not be limited to one particular coordinate 
system. The Eqs. (3-4-15) have the property that if they hold for one system 
they will hold for al// systems and reduce properly to the equation of electro- 
statics in the event that there is no time dependence in any given system. 


3-5 THE ELECTRIC AND MAGNETIC FIELDS AS ELEMENTS 
OF A SECOND-RANK TENSOR 


Our next step is to find the fields that exist at each point in space in terms 
of the four components of vector potential. Again we use what we have 


3-5 THE FIELDS AS ELEMENTS OF A TENSOR 127 


learned in electrostatics to point the way for us. We recall that in the absence 


of time dependence we have E = —Vg, which we can rewrite as 
es 
—iE, = a (3-5-1) 
as 


It is clear then that the three components of electric field are really elements 
of a second-rank tensor in our four-dimensional space. Our job is now 
reduced to finding the complete tensor. 

So far everything we have done has been entirely deductive, making 
use only of Coulomb’s law, conservation of charge under Lorentz trans- 
formation, and the requirement of Lorentz invariance for our physical laws. 
We have now come to the end of this deductive path. At this point when the 
laws were being written, God had to make a decision. In general there are 
16 components to a second-rank tensor in four dimensions. However, 
in analogy to three dimensions we can make a major simplification by 
choosing the completely antisymmetric tensor to represent our field quan- 
tities. Then we would have only 6 independent components instead of the 
possible 16. Under Lorentz transformation the tensor would remain anti- 
symmetric and we would never have need for more than six independent 
components. Appreciating this and having a deep aversion to useless 
complication, God naturally chose’ the antisymmetric tensor as His medium 
of expression. We define this tensor F,,, as follows: 


— 2 . 24 ei 


OX, = OX, 


1It has been pointed out to the author by Prof. D. Dorfan that God was actually quite con- 

strained in His choice of the second-rank tensor. The demonstration that the tensor we are 
looking for (which we shall call F) must be totally antisymmetric goes as follows. We first 
observe that in the instantaneous rest frame of a particle of charge q the force on it must be 
equal to gE where E is the electric field it sees. The force is also equal to the rate of change 
of the particle’s momentum p in this frame of reference. Hence we have, making use of 
Eq. (3-5-1), 


a=, for f = 1,.2,3 (3-5-1a) 


Saar. ‘fort = 1, 263) 4 (3-5-1) 


Next we observe that the momentum four-vector in the particle’s instantaneous rest frame is 
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Writing out the terms explicitly, we have 


eer {eo 1} ro 

0 (V x A), —(V x A), i cn Ae 
ome [0 1 ar) 

2(V x A). 0 (V x A) ey ae 
r ‘ Fr (2% 1 2) 

(V x A), —(V x A), = ae 

6p 1 OA (do 1 9) (2 1 =.) 
—i{— + — —) -{ —+— —] -1—+— 0 
(2 + ( “) (2 a c Ot : oz c Ot 
(3-5-3) 


where A is the three-vector (A,,A,,A,). Notice now the remarkable fact 
that under a pure rotation the tensor F breaks into a polar vector and an 
axial vector. The components (F3, — F,3,/,2) are really the three compo- 
nents of V x A and transform as an axial vector. The three components 


1 OA aa 
(F,4,fo4,F34) are clearly equivalent to (Vo + es *) which is a polar 
vector. Hence in any given coordinate system it would appear as though 
there were two independent vectorial fields at each point in space. Only 


Pu = (0,0,0, imc) (3-5-1c) 


Thus Eq. (3-5-15) can be rewritten in the form 


dp, 

ae VF, 3-5-1 

dt = moc - Prt, ea) 
Both sides of Eq. (3-5-1d) now have the appearance of components of a vector. We next take 
the scalar product of this vector with the vector 2p: 


4 dp 4 
2,—=—2 > PVE Py : (3-5-1e) 


in" 
eal dt Mol y=1 p=1 


We observe that the left side of the equation can be rewritten as 


4 4 
We thus have 
4 4 
Ly »Y pA.p, = 0 (3-5-If) 


v=1 yn=1 


The left side of Eq. (3-5-1f) is a scalar. Hence the equation is true in any coordinate system 
which can be related to our instantaneous rest system by means of a Lorentz transformation. 
It is now left to the student to prove that if Eq. (3-5-1/) is to be true for any choice of system 
then F must be antisymmetric. 
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when we transform between them do the two fields become mixed together. 
In any case, let us return to Eq. (3-5-3) and make some important 


: ; 1 OA 
observations. The first of these is that Vo and — ae 
c 
Component by component they always occur together, and so a particle 
must experience their combination as though it were one type of force. 
Inasmuch as we identified —Vg as E in the case of electrostatics, we now 
write 


are inseparable! 


E = —Vo —- — — -5-4 
0 (3-5-4) 
(This result embodies all of Faraday’s famous law and is even more general. 
Its importance will become apparent in due course.) 
We now make a definition. We define the magnetic field B as 


B=VxA (3-5-5) 
We have then 
OF) +8, — 8, 12 
—B, 0 +B, —i, 
oes (3-5-6) 
+B, —B, ‘0 —iE, 
iE, 0 
Before we actually see how this magnetic field acts on a moving 
charge, we should examine in detail the behavior of the components of 
F as we go from one coordinate system to another through a Lorentz 


transformation. We will assume as usual that 2’ is moving along the positive 
x axis of © with velocity V. As usual we have 


4 
r= 5 eee, (3-5-7) 
a,p=1 


Remembering that the Lorentz transformation matrix is given by Eq. 
(3-2-10), we have 


0 Me BE) Ber Fey) a i 
a gt Bree PE,) 0 B, —iy(E, — BB,) 
B+ BE) =&, 0 —iv(E, + BB,) 
iE, in(E, — BB.) iE, + BB,) 0 
(3-5-8) 


where B = V/cand y = 1/,/1 — B?. 
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We first observe that the component of either E or B along the direc- 
tion of relative motion remains unchanged. This component is called the 
longitudinal component and in the present case is the component along the 
x direction. ; 

B, = B, 


EL = E, 


(3-5-9) 


As far as the transverse components of E are concerned, we note that 


E’ 


y 


V 
= yE, soak), a B, 
eee =v x B), (3-5-10) 


E, = yE, + —(V  B), 


Hence we can write in general, for the component of E transverse to the 
relative motion of 2 and x’, 


Vv 
E; = i| Es + (~ x 8), | 


However, since (V/c) x B has no longitudinal component, we can simplify 
this to 


E, = i| Es e (~ x B) | (3-5-11) 


Similarly we have 
: Vv 
B; = (3; = x E) (3-5-12) 


We can now easily find the force on a moving charge gq moving with 
velocity V in the presence of electric and magnetic fields. We need merely 
transform to the rest system of the particle, set the force on the particle in 
this system equal to gE’ = dp’/dt, and then go back to the original system, 
transforming dp’/dt as we change. (t as we remember is time as measured 
in the particle’s rest system.) We have for the transverse components 

= = (Er a x B) (3-5-13) 


But p; = Pr. since the transverse components of p are unchanged by a 
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Lorentz transformation. Also, from time dilation we have t = yt. Hence 


apr Vv 
a ed — OB (3-5-14) 


Turning to the longitudinal component of E, we have 


a oy 
or 
d / 
i = Lo GB (3-5-15) 


But, making use of the Lorentz transformation and remembering that >’ 
is the rest system of the particle, we have 


Pr = Yi + Bymoc 
Therefore dp, = y dp; and hence 


WL —_ gk. (3-5-16) 


Combining this with our previous result for the transverse components of 
the force, we have 
Vv 

® P= gE+g~xB (3-5-17) 

This rather fundamental equation tells us precisely how electric and 
magnetic fields act on a moving charged particle. We can now, in principle, 
solve any dynamics problem involving the interaction of charged particles 
and currents if we can only find a procedure for determining these fields. 
We already know how to determine E from a static charge distribution. 
We also know how to evaluate A(r) if there is no time dependence to our 
currents, and as a result we know implicitly how to determine the magnetic 
fields which derive from static currents. In the chapters ahead we shall 
develop explicitly many of the techniques for determining the fields as a 
function of time and space. Before doing so, however, we must derive 
Maxwell’s equations in all their beauty. 


3-6 MAXWELL’S EQUATIONS 


Our derivation of Maxwell’s equations will again make use of the require- 
ment of Lorentz invariance. We will begin with the old electrostatic equation 
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V - E = 4np and ask what we must do in order that it hold in all systems. 
This will lead immediately to an equation relating the magnetic field B 
to the current density j. 

Let us first write V- E = 4zp in relativistic notation. Remembering 
that F,, = 0, we can rewrite V - E as 


CE, | Chee, 
~ Ox oy Oz 


_ ,( 2Fia , Fae , OFse 
NON Ox 0x3 


V-E 


= —iAnj, (3-6-1) 


But, we recall from Eqs. (1-6-6) to (1-6-11) that the product of a 
second-rank tensor and a vector transforms as a vector. Hence V- E, 
as expected, is the fourth component of a vector, and we can now insist that 
Eq. (3-6-1) be generalized to the other components: 


Sar OF, 

Ly a = —4n, (3-6-2) 
pel 6 be 

Let us develop this explicitly for the first three components: 


Bees. Wee, 


ae oe pe 

6B, OB, 1 @E, ' 

oe Gp | meen es) 
6B, OB, 1 OE, 

Say ea 


Combining all three equations, we have 


I : 
a ee (3-6-4) 


The fourth component, to repeat, gave us 

2. V-E=4n9 (3-6-5) 
Since B = V x A and the divergence of any curl is zero (see page 21), we 
have 


3. V:B=0 (3-6-6) 
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And finally, from E = —Vo — a = we have 
c 


VxE=- ea v« ss 
c ot 
Inverting the order of differentiation, we get 
1 6B 
a OV «ES 6. 
i (3-6-7) 
These then are the famous Maxwell equations. Note that the first 
and second of these are completely equivalent to but not nearly as aestheti- 
cally pleasing as Eq. (3-6-2). We would like to put the last two Maxwell 
equations in an equally beautiful form, but first we must introduce the so- 
called completely antisymmetrical tensor of the fourth rank, ¢,,,,, defined 
as follows: 


Evvap = 0 if any two indices are equal 

E1234 = | 

Ewvap = 1 if u,v,A, and p are all different and can be obtained 
from 1,2,3, and 4 by an even number of transpositions 

ao =i if u,v,A, and p are all different and can be obtained 


from 1,2,3, and 4 by an odd number of transpositions 


We shall make use of two properties of Exvaps the demonstration of which 
we will leave to the reader. 


1. The components of ¢ transform into themselves under Lorentz 
transformation. 


2. The sum )’ ¢,,,,B,, transforms as an element of a second-rank tensor 
Ap 


vAp 


under Lorentz transformation if B,, is an element of a second-rank 
tensor. Making use of this, we define the tensor G by the equation 


i: 
Gay = zi Y Seal (3-6-8) 


where F,, is an element of our field tensor. 
The tensor G is antisymmetric and has elements 


ale 

Gyr = Zi(E1234F 34 + €1243%43) 
lg 
pele 

Gi3 = gi(€y324Foq + €1342F42) 
matty 
2 les 

Gig = 21(E,423Fo3 + €1432F32) 
= iFj3 
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pee, 
Go3 = Si(€r314Fig + €2341F41) 


= iFy4 
Gra = Fi(E2413Fi3 + 2431 F531) 
= —if, 
G34 = ZilEsa12Mi2 + 3421721) 
= iF, 
Summarizing, we have 
0 E, —E, iB, 
—E, 0 ‘Ee iB 
G = : (3-6-9) 
E, —E, 0 iB, 


—iB, —iB,. —iB, 0 
Now, since B = V x A, wecan write 
OG 4 


a 


Ht Xp 


=i{¥-B— 0 
For this to prevail in any coordinate system, we must have 


ys oe 0 forall v (3-6-10) 
OX, 


1 6B 
When we set v = 1, 2, 3, we just come up with V x E = — ae Hence 


the four Maxwell equations can be rewritten as 


(3-6-11) 


From the purely aesthetic point of view there is something wrong 
with these equations. Whereas the electric field can have a static source, 
namely, p, the magnetic field cannot. Wouldn’t it be nice if somehow there 
were magnetic monopoles possible in nature so that the two equations would 
be “symmetric”? Suppose for a moment that such objects did exist. We 
could then define a so-called magnetic charge density p” and a corresponding 
magnetic current density j*”. (The magnetic current density would actually 
be an axial three-vector field.) There would be a corresponding potential 
four-vector A," of which the first three components would form an axial 
three-vector. Finally, we would rewrite the second Eq. (3-6-11) as 


uy any (3-6-12) 
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i current 


Fig. 3-5 A current of mag- 
netic monopoles is circulating 
around a closed loop as shown. 
We evaluate $E - dl around the 
curve C which links the mono- 
pole current. 


Now the possibility that magnetic monopoles exist has actually 
excited wide interest for another reason. It was pointed out by Prof. P. A. M. 
Dirac that their existence would explain the quantization of charge in a 
natural way.’ This being the case a considerable amount of effort has been 
devoted to finding these monopoles. The efforts have been of two different 
sorts. 


1. The fact that monopoles would be strongly acted upon by magnetic 
fields has been exploited by a number of experiments which have tried 
to pull them out of matter or out of the cosmic radiation by means 
of such fields.” 

2. Before we describe the second type of experiment we must put Eq. 
(3-6-12) into more convenient form. If we let y = 1,2, and 3, we obtain 


VxE= = —— = 4nj™ (3-6-13) 


Let us next imagine that we have a steady continuous current of mono- 
poles circulating around a closed loop as shown in Fig. 3-5. Nothing is 
changing with time, and so we are justified in removing the time- 
dependent term on the right side of Eq. (3-6-13). If the total magnetic 
“charge” in the loop is QO and the time it takes for a given monopole 
to make it around is 7, then the monopole current in the loop is just 


(m) 
ym - 2" 
CHE 


[The factor c is required because our four-vector is (j,”,j,""".j,, ip"). 
That is, p“ and j* are measured in different units in complete analogy 
with p and j (see page 123). ] 

Let us next integrate E- dl around an imaginary closed curve C 


'For a review of this subject in reasonably simple terms the reader is referred to an article by 
J. Schwinger in Science, 165: 757 (Aug. 22, 1969). 


2See, for example, E. M. Purcell, G. B. Collins, T. Fujii, J. Hornbostel, and F. Turkot, Phys. 
Rev., 129: 2326 (1963). 
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linking the loop. Using Stokes’ theorem, we convert the line integral 
into a surface integral over the area bounded by C. 


$e-a= | V x E-fidA 
c A 


= in| j’-adAa 
A 
= 4x7 


(m 
tots (3-6-14) 
Co (i 
If a loop of wire were thus allowed to link the monopole current loop, 
then Eq. (3-6-14) would ensure that a current would flow in the loop. 
Indeed, it would be as though a battery with a potential difference of 
4nQ"/cT statvolts were placed in the wire loop. 


This brings us then to the description of an actual experiment carried 
out by Prof. L. W. Alvarez! and collaborators at the University of California 
Lawrence Radiation Laboratory. They were interested in determining 
whether any monopoles might be trapped in pieces of moon rock brought 
back by the astronauts. After all, the moon’s surface is being continuously 
bombarded by cosmic rays, and an occasional one might be a monopole. 
A small bottle of moon rock was placed on a track, as shown in Fig. 3-6, 
and made to go around and around with a time for | revolution of about 4 
sec. Linking the track was a superconducting niobium coil with 1200 
turns of wire. (A superconducting coil has no measurable resistance.) If 
there were a monopole trapped in the rock, then the electric current through 
the superconducting loop would change as a function of time. The rate of 
change of electric current will depend on the line integral of Eq. (3-6-14) 
as well as on the inductance of the coil, a property which we will discuss in 
due time. Suffice it to say that no monopole has yet been seen. We will return 
to this experiment in Sec. 5-1, after we have developed some more formal 
machinery, in order that we may be able to assess its true sensitivity. We 
will see at that time that it is capable of detecting the smallest quantum of 
magnetic charge consistent with Dirac’s theory. 

In any case, since no monopoles have as yet been seen, we will continue 
to write Maxwell’s equations in the conventional form given by Eq. (3-6-11). 
Perhaps at some future time we will have need to change them. 

We have now completed the foundation of our electromagnetic 
edifice. In the chapters ahead we will undertake an examination of the 
incredible richness of Maxwell’s equations, culminating in the discovery that 


'L. W. Alvarez et al., Science, 167 (3917): 701-703 (1970). 
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| Sammele path } ple path and superconducting 
\ tT / coil used by Prof. L. W. Alvarez 
ae J and collaborators to search for 
—— a ee ee ene monopoles in moon rocks. 


accelerating charges must radiate light. At no point should we lose sight 
though of the very simple assumptions that have gone into our derivation 
of this theory. Basically all we have used are Coulomb’s law, the requirement 
of relativistic invariance, and a passion for simplicity and elegance. Remark- 
able indeed! 


PROBLEMS 


3-1. 


3-3. 


3-4. 


A neutral pion decays in about 10~!° sec into two photons (y rays). Neutral 
pions can be produced in the laboratory by stopping negative pions in hydrogen 
and allowing them to be captured into atomic orbits about individual protons. 
When they reach the ground state of the atom, the negative pions are captured 
and the system undergoes the reaction 


kK +p7n +n 

(a) Ignoring the binding energy of the x in the atom before its capture, find the 
velocity of the x° emerging from the reaction. 

(b) What is the kinetic energy of the emerging neutron? 

(c) How far does the z° travel in the laboratory if it lives for a time of 107! sec 
as measured in its own rest system? 

(d) What is the observed spectrum of gamma rays emerging from the hydrogen 
target? 

An antiproton (f) can be produced by firing energetic protons at other protons 

which are stationary in the laboratory. The reaction which takes place is 


PEP Spt Pp + p+ p 


Find the threshold for this reaction. That is, find the energy of the least-energetic 
incoming proton that can initiate the reaction. 


A A° particle lives for a time of 2.4 x 107 1° sec in its own frame of reference. 
How far does it travel in the laboratory if it is moving with velocity 0.5c, 0.9c, 
0.99¢? 


An electron traveling in a straight line along the x axis moves by a stationary 
observer as shown. The observer is located on the z axis with coordinates 
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Zz 


Observer a 
“se Electron 


(0,0,1). Find and graph the z component of electric field seen by the observer 
for each of the following electron velocities: 0.6c, 0.8c, 0.9c, 0.99c. 


3-5. Show that the components of ¢,,,, when transformed under a Lorentz trans- 
formation remain unaltered. 


3-6. Show that if B is a second-rank tensor then )” ¢,,,,B,, is an element of a second- 
rank tensor. ae 


3-7. Show that E? — B? and B- E are each invariant under Lorentz transformation. 


3-8. Let E and B be normal to each other and let |E| < |B]. Find the velocity v 
of a system within which E = 0. 


3-9. Consider a classical electron in a circular Orbit of radius R about a proton. 
What is the magnetic field seen by an observer whose velocity coincides instan- 
taneously with that of the electron? 


3-10. Show that if we alter both A and ¢ by the transformation 


AOS A ay 
a 1 op 
Na (Qe 


where w is any function of space and time, then the electric and magnetic fields 
are unaltered. This transformation is called a gauge transformation. 


3-11. Consider a single magnetic monopole with unit magnetic “‘charge”’ traveling in 
a straight line down the x axis. An observer stationed at coordinate (0,0,1) 
on the z axis will observe a pulse of efectric field as the monopole goes by. 

(a) Plot the magnitude of the y component of this field as a function of time 
for each of the following choices of monopole velocity: B = 0.6, B = 0.8, 
B= 0.9, B = 0.99. 

(b) Plot the x and z components of the electric field for the same set of velocities. 


4 
Time-Independent 
Current Distributions; 
Miagnetostatics 


In this chapter we will develop a variety of techniques for determining the 
magnetic field B which arises as the result of a time-independent current 
distribution. We will then consider the dynamics of a moving charged 
particle or a current loop within this field with a number of interesting and 
practical results. 

Basically there are three different ways in which we might approach 
the question of determining B as a function of position in space. We might 
attempt to find the vector potential A first, making use of either the differen- 
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tial equations (3-4-10) to (3-4-12) or the integral equation (3-4-14). We 
would then find B by determining V x A. Alternatively we might begin 
with Eq. (3-4-14) in its general form, take its curl, and come up with a 
generally useful integral for B directly. Finally we might see if there are 
any simple integrals for B which might be useful in cases of high symmetry. 
As we shall see, each of these techniques is quite analogous to a corre- 
sponding technique that we have learned in electrostatics. Before we begin 
on this path, however, it is useful to say just a few words about the behavior 
of currents in typical conductors. We will develop the approximation known 
as Ohm’s law. 


4-1 AN ELEMENTARY DERIVATION OF OHM’S LAW 


In most metals there is about one electron per atom that is relatively free 
to move about, being subject only to macroscopic electric fields and scattering 
on the individual lattice sites. The average velocity of an electron (0) is 
of the order of 108 cm/sec and, inasmuch as the typical mean free path is 
of the order of 10~° cm, there is very little change in the velocity due to 
any reasonable applied field in the time between collisions. Let 1 be the 
mean free path. Then the time between collisions is At = 4/v. The average 
drift velocity gained in this time interval due to an applied field E is 


| (eE | ek A 
= (441) - a (4-1-1) 


After a collision it is assumed that the velocity of the electron is randomized 
in direction, and so the average induced current density is 
_ Ne?EA 
= ——_ emu 
2mve 
where N is the number of “‘free” oe rone per cubic centimeter. We write 
this vectorially as 


j= oE (4-1-2) 
where 
Ne?1 
~ Imbe eal) 


is called the conductivity. (In actuality the drift speed is some 10 orders of 
magnitude lower than 5. Hence the simple approximation we made, that 
At = A/t, is quite reasonable.) 

This simple proportionality between the applied field and the current 
density is called Ohm’s law. It is most often applied to the case of a given 
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piece of conductor of area A and length /. In that situation we see that the 
total current is J = jA and the potential difference Ag across the conductor 
is given by E/. Ohm’s law then says that 


A 
1 oA 
Hence 
Age . tf | 
[.c1 * (4-1-4) 


where R is called the resistance. 

We can calculate the work done per unit time on the current j by the 
field E. The work done per unit time on a charge gq is just F-v = gE-v. 
Hence the work done on the current per unit volume per unit time is just 


dw 
7 ea E = jew E = cj:E (4-1-5) 


In the event that j = oE, we can write 


dw 

dt 

We should comment at this point that the system of units we are using 
here for R and g is such as to have the current always measured in abamperes 
and the potential in statvolts. The unit of resistance is thus the statvolt per 
abampere. Most texts when dealing with resistance will stay entirely with 
either esu or emu units. We have chosen to ignore that convention for the 
sake of overall consistency. Inasmuch as most resistors are labeled in ohms 
anyway, we need only know that 


= col (4-1-6) 


1 statvolt/abampere = 29.98 ohms 


4-2, FINDING THE MAGNETIC FIELD THROUGH THE VECTOR POTENTIAL 


Finding the vector potential A in magnetostatics is quite analogous with 
finding the potential @ in electrostatics. We can either carry out the integral 


A(r) = | ee 


or we can search for the unique solutions to V?A; = —4zj; subject to the 
appropriate boundary conditions. These methods will be most simply 
applied if we can make a direct analogy to an electrostatic problem that we 
solve easily and then adapt the solution appropriately. We will illustrate 
by application to two simple examples. 
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The first problem we solve is to find the magnetic field everywhere 
due to an infinite cylindrical uniform current distribution of radius R 
(see Fig. 4-1). We take the origin of our coordinate system on the axis of the 
cylinder and let the direction of current flow be along the z coordinate axis. 
The current density within the cylinder is j. 

The analogous problem is, of course, that of a uniform cylindrical 
charge distribution. We remember, making use of Gauss’ law, that the 
electric field due to such a distribution is radial and has a magnitude £, 
at a distance r from the axis given by 


2npr forr< R 
E, = 2npR? 
r 


(4-2-1) 
forr>R 


If we try to calculate the potential in the usual way by integrating out to 
infinity, we will come up with an infinite result because we are using an 
infinite cylinder. Since in the end it will be the fields and not the potentials 
that interest us, let us renormalize so as to have gy = 0 at r = 0. We have 
then 


6) = = | “Eee 
0 


Fig. 4-1 We find the field around a cylinder of 
uniform current density. 
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This yields 
— pr? forr<R 
g(r) = + ; ; (4-2-2) 
—7pR 1+ 2in- forr > R 


Now back to the magnetostatic problem that really interests us. 
Since there is only a z component to the current, we have A, = A, = 0. 
We also have 


Ar) = | vAG ) ahiee 
all |r el 
space 


Our analogy tells us that 


—njr? forr< R 
(4-2-3) 


A,r) = iG 
—njR? 1+ 2In> forr>R 


Finally, we remember how to take the curl in cylindrical coordinates [see 
Eq. (2-9-16)]. We have in this case 


VxA=- = 6 (4-2-4) 
Thus 
2njr 6 forr < R 
B= oni? , (4-2-5) 
: 0 forr>R 


As we shall soon see, this result could have been obtained much more 
rapidly through more sophisticated means (Ampere’s law). Nevertheless 
it does serve as a useful first example. 

As our second example we consider an infinite sheet of current having 
a thickness ¢ and a constant current density. We set our coordinate system 
up with the x axis perpendicular to the sheet and the z axis along the current 
(see Fig. 4-2). The current density is then j = jk where k is a unit vector 
in the z direction. 

Again the vector potential is given by 


i(r’) dv’ «leg 
A(r) = | eka | k (4-2-6) 
sheet sheet 


rr F-F| 


The corresponding electrostatic problem would be a sheet with uniform 
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Fig. 4-2 We find the field due to an infinite sheet of uniform 
current density j and thickness ?. 


charge density p, and we would have had 


dV’ 
p(t) = p \ Te i (4-2-7) 
sheet |r al | 
We solve the electrostatic problem by means of Gauss’ law, obtaining 
2npti forx = + 
t t 
E = < 4apxi for — Dh <a =F (4-2-8) 
x t 
—2npti forx < —- ~— 
Z 
If we set 9 = Oat x = 0, we have 
2 
5 — 2nptx for x= 
t t 
@ =4 —2npx for — 5 SX > (4-2-9) 
npt? t 


5 + 2nptx forx < —. 
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Hence, by complete analogy, we can write down A for our original problem: 


nt? fe t 
Eee.) > 
( 5) nix fOr x= 5 


we) ee, t t 
An= 2njx*k for — mr Sa ae (4-2-10) 
nt? i t 
ede : een 
( 5 + ant forexes 5 
Finally, taking the curl of A, we have 
: t 
2ntjj for x = 55 
4 t t 
B= ¢ 4njxj for — a SY S > (4-2-11) 


- t 
—2ntjj forx S — os 


where j is the unit vector in the y direction (not to be confused with current 
density). 

As a rule we will find the techniques of this section to be of marginal 
utility because it will normally be no harder to calculate the field directly 
than it will be to go through the vector potential. Furthermore, there is no 
situation analogous to a set of conductors with fixed potentials to determine 
our boundary conditions. Nevertheless, when all else fails, this method 
may provide a path to the solution we seek. 


4-3 THE BIOT-SAVART LAW 


Making use of our expression for A(r), Eq. (3-4-14), we can proceed directly 
to find a general expression for B(r). 


Bir) = V x A(r) 
-vx | ja) av’ 


all |r = r’| 


“ | ify x @ -¥) (4-3-1) 


all |r aE 
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This result is called the Biot-Savart law. It is a simple prescription for 
finding the magnetic field B at any point r by integrating over the current 
distribution. 

We often deal with situations where the actual current distribution 
within a wire itself is not important and can be averaged over. This situation 
occurs when the thickness of the wire is small compared with our distance 
from it. In these cases we can integrate over the cross-sectional area of the 
wire and replace the volume integral in Eq. (4-3-1) by a line integral along 
the wire. We have then 


Ia’ x (r—-r) 
B(r) = } orp 
around 


wire 


(4-3-2) 


It is very tempting at this point to say that the little bit of wire dl’ 
gives rise to a magnetic field dB(r) given by 


IdY x (r-Yr) 


dB(r) = Ir =, re 


(4-3-3) 
If we then used Eq. (3-5-17) to find the force exerted by dB on a charge q 
at position r and moving with velocity V, we would find 

IqV a’ x@r-r) 


hae ee (4-3-4) 


The problem, as can be plainly seen, is that this force violates Newton’s 
third law. It is not directed along a line joining the current element dl’ 
with the charge q. 

It is not possible for us to completely resolve this dilemma at this 
point in our course. We can however point out one serious problem. We 
are not entitled to write down Eq. (4-3-3) at all, since doing so presupposes 
that we can calculate the effect of a segment of wire as though the remainder 
of the wire did not exist. But if the remainder of the wire did not exist, then 
the current flowing in the segment would cause charge to pile up at its 
two ends and we would no longer be dealing with a magnetostatic problem. 
Now we can in principle break a current loop up in this way, and the charge 
pileups at the ends of the segments would just cancel when they were put 
back together to reform the loop. Hence, if we knew how to deal with the 
physically realistic case of a current segment with charging ends, we would 
be out of the woods. This knowledge will come in our next chapter. 

In any case, let us return to Eq. (4-3-2), which is perfectly correct as 
long as our integration covers the entire current distribution. We will apply 
it to find the magnetic field along the axis of a circular current loop of 
radius R carrying current J. Referring to Fig. 4-3, we place the origin of our 
coordinate system at the center of the loop. We set the x axis normal to the 
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Fig. 4-3 We wish to find the magnetic field along the axis of a circular 
current loop. By symmetry the field can have no y or z components. 


loop so that when we look in the positive x direction the current appears 
to be going clockwise. We see immediately that only the x component of 
the field can exist after we have integrated over the loop, and it will be given 


by 
Tdl : 
B, = § ge sino 


ware 
Re 


= = sin? 0 (4-3-5) 


Making use of this result, we can find the magnetic field along the 
axis of a solenoid having N turns per centimeter with each carrying a current 
I. We choose a point along the axis at which we want to evaluate the field 
and let 0, be the half-angle subtended by one end and 9, be the half-angle 
subtended by the other end (see Fig. 4-4). The segment of the solenoid 
lying between 6 and 6 + d@ has a length 


= dé 


sin? 0 
Hence the contribution to B is given by 


7 2nNI dx 


i sin? 9 = 2xNI/ sin 6 dO 


dB 


Fig. 4-4 A long solenoid of radius R, having N turns per centimeter, each turn carrying 
current /. 


Current flow 
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Integrating from 6, to 0,, we have 
B = 2nNI(cos 6, — cos 6) (4-3-6) 


We see then that the field at the center of a very long solenoid is approxi- 
mately 4xNIJ. The field at one end of the same solenoid is approximately 
2nNI. 

Before we go on, we should get some feeling for magnitudes. The 
unit in which magnetic field is expressed is the gauss. Currents are measured 
in abamperes, and the only conversion factor we need to remember here 
is that 10 amperes = 1 abampere. Now, to take a reasonable example, 
we might choose N to be 10 turns per centimeter and J to be 1 abampere 
(10 amperes). In that case, the field in the middle of a long solenoid would 
be about 120 gauss. The largest fields that can be easily reached with normal 
electromagnets are in the range of 20,000 to 30,000 gauss. Superconducting 
and pulsed magnets can attain fields in the range of 100,000 to 200,000 gauss. 
Fields as high as 10° gauss have only rarely been achieved, in small volumes 
and with pulsed magnets. We shall shortly learn that the energy density of 
magnetic field is given by (1/82) B?, setting a rather clear limit on the field 
magnitudes that are attainable. 

For comparison, we might have a look at electric fields which are 
also measured in gauss. (Remember that electric and magnetic fields are 
measured in the same units.) A useful conversion factor to remember is 
that 1 gauss = 300 volts/em. The largest practical electric fields, in the 
neighborhood of 10° gauss, are thus considerably lower than the largest 
practical magnetic fields. 


4-4 AMPERE’S LAW 


When we are faced with a situation which has a high degree of symmetry, 
it is convenient to make use of Ampere’s law for determining the magnitude 
of the magnetic field. ; 

We begin with the magnetostatic equation 


V x B= 4nj (4-4-1) 


Making use of Stokes’ theorem, we can now evaluate the line integral of 
B around a closed curve: 


pea [ V x B-fidA (4-4-2) 
ic surface 


bounded 
by C 


The normal vector i is chosen to point in the direction given by a right-hand 
rule as applied to C (see page 19). Inserting Eq. (4-4-1) into Eq. (4-4-2) 
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yields 


$B a= 40 j-fidA 
c surface 


bounded 
by C 


= 4n(current passing through surface bounded by C) (4-4-3) 


We can now return to the problem we treated earlier, the uniform 
infinite, cylindrical current distribution (see Fig. 4-1). Once we decide 
that the field must be in the 6 direction (by using the Biot-Savart law, for 
example), we can let C be a circle of radius r about the axis of the cylinder 
and apply Eq. (4-4-3) to determine the magnitude of B. We have 


Bal = 2nrB 


Hence, for r < R, we write 
2nrB = (4n/)(nr7) 
and 
B= 2njr (4-4-4) 
For r = R, we have 
2nrB = (4nj)(xR7) 
and 
2 


B (4-4-5) 


These results are identical with what was found earlier [see Eq. (4-2-5)]. 


4-5 B AS THE GRADIENT OF A POTENTIAL FUNCTION 


Under certain circumstances it is possible to express B as the gradient of a 
potential function g,,. These circumstances require that the current distri- 
bution be in the form of closed loops and that the point at which the field 
is to be evaluated be in a current-free region. We shall derive an expression 
for g,, explicitly for one simple current loop. In the event that there are 
many such loops to consider, the fields we obtain in this manner can be 
summed together. 

Referring to Fig. 4-Sa, we shall assume a current J in the loop C’ 
and place an imaginary surface S’ over the loop with normal vector fi’ 
directed as shown. The vector r’ refers then to a point on the loop and the 
vector r to the position at which we wish to evaluate B. The vector potential 
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Fig. 4-5 (a) Finding the field at r due to 
(a) a current loop. 


at ris 


A(r) = 16 aise (4-5-1) 
c |r—r| 


At this point we make use of a simple mathematical identity. For any 
function f we can write 


} fdl= | (a x VA dA (4-5-2) 
( surface 


bounded 
by C 
Taking f = 1/|r — r’|, we have 
1 
A(r) = | fi’ x V ———_ dA’ (4-5-3) 
x lr—r'| 
Taking the curl of A to find B, we obtain 


dA’ 


waile . 
jr — ¥ 


| Vv x (« x Va) 
a lr — r'| 


But V’(1/|r — r’|) = —V(1/|r — r’|) and hence 


B(r) 


vx | fi x V’ 
S 


s 1 is 
Br) = -1| Vx (0 x V rh dA (4-5-4) 


We next make use of two rather simple vector identities to develop Eq. 
(4-5-4). 
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V x (A x B) = A(V-B) — B(V- A) + (B- V)A — (A- V)B (4-5-5) 
V(A-B) = (A: V)B + (B-V)A + A x (V x B) + Bx (V x A) 


(4-5-6) 
In these identities, the expression (A - V)B means 
0 0 6 
A: V)B = | A, — + A, — + A, — 
( ) alll dy Cs =) 
OB oB OB 
=a + (4-5-7) 


We first apply Eq. (4-5-5) to the integrand of Eq. (4-5-4). Remembering 
that f’ does not depend on r, we obtain 


1 
Vx G x vo) = a(v ra = (vy Vv — 


lr—r |r lr—r 


(4-5-8) 


Since r’ + r (we are interested in the field in a current-free region), we can 
set V7(1/|r — r’|) = 0, leaving us with 


1 
Vv x (w * va] = —(f’: V) V—— 
jr—r o |r— er 
Applying Eq. (4-5-6), we obtain 
! J 
—(f’ : ————| =- ‘.-V¥—— 4-5-9 


Inserting this back into Eq. (4-5-4), we finally obtain an expression for 
B(r): 


1 
Be) = 1V| i in Ty 
e lr —r’| 
a -1¥ | a ae (4-5-10) 
s |r—r 


Now we let dQ’ be the solid angle subtended by dA’ with respect 
to the position r. We will choose dQ’ as positive if fi’ - (r — r’) is positive 
and negative if fi’ - (r — r’) is negative. In that case we can see that 


_&-@-r) 


dA’ (4-5-11) 
lr — r’|? 


AQ’ 


Hence we have proven our point. 
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Bir) = —Vo,, (4-5-12) 
where 
Pm = IQ (4-5-13) 


and ’ is the solid angle subtended by the surface S’ with respect to r. 
Let us make use of this result to recalculate the magnetic field along 
the axis of a circular current loop. Referring back to Fig. 4-3, we note that 
the solid angle is just 
] 


YY = an | sin 6 a6 


= 2nx(1 + cos 6) (4-5-14) 


The gradient of 9’ is in the negative x direction, and hence B is in the 
positive x direction. We have then 


B, = ee. cos 0 
Ox 


= Dill a — 
Oxee |X" eRe 
2nIR? 
ayant dy ase 
(x? + R?)? a) 
Needless to say, this is the same result as we obtained earlier [see Eq. 


(4-3-5)]. 

The general results we have just obtained are particularly useful 
if we wish to find the magnetic field at a long distance from a small flat 
current loop of area A and current J. We have then (see Fig. 4-55) 


Pm = TAR: = (4-5-16) 


fh Fig. 4-5 (cont’d) (b) We find the mag- 

netic field at large distances from a small 

flat current loop of area A carrying current 

I. The origin of our coordinate system is 

I taken at the approximate center of the 
(b) loop. 
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We define the magnetic moment of the loop as 
m = JAfi (4-5-17) 
We then obtain the following simple approximation for @,,. 


Sen. = ay 4-5-18 
Om = M-—s = —m- V— (4-5-18) 
We can compare this with the expression we obtained for the electric poten- 
tial of an electric dipole [see Eq. (2-6-7) ]: 


1 
Pe = 7 ies 


The similarity of these expressions leads us to conclude that the magnetic 
field distribution at large distances from a magnetic dipole is identical to 
the electric field distribution at large distances from an electric dipole. We 
need merely substitute m for p and B for E to obtain the corresponding 
results. 


4-6 MAGNETIZATION (M) AND THE H FIELD 


As we know, matter on the atomic level is made up of relatively stationary 
nuclei surrounded by electrons in various orbits. We shall discuss the 
electromagnetic properties of matter in much more detail in Chap. 10. 
In the meantime, though, we point out that there are three mechanisms 
whereby matter may acquire a macroscopic magnetic-moment distribution. 
(By macroscopic we mean averaged over a large number of atoms. This 
distribution is characterized by the magnetic moment per unit volume M, 
called the magnetization.) The three mechanisms are as follows: 


1. Theelectrons’ orbits or their intrinsic angular momenta may be arranged 
so as to give rise to a net magnetic moment within each atomic system. 
In general, thermal agitation will tend to randomize the directions of 
these moments, and only when an external field is applied so as to 
counteract this randomization do we have any macroscopic magnetiza- 
tion. Materials which behave in this way are called paramagnetic. 

2. We shall soon learn that a changing magnetic field within a conductor 
induces currents which oppose this change. This phenomenon is called 
diamagnetism. The application of a magnetic field to a diamagnetic 
medium will induce currents within the atomic systems, and these in 
turn will lead to a macroscopic magnetic-moment density opposite in 
direction to the applied field. 

3. Iron has two rather remarkable atomic properties that have far-reaching 
consequences with respect to its macroscopic magnetic behavior. First, 
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4 of the 26 electrons in an isolated atom of iron have their intrinsic 
angular momenta lined up. Second, within solid iron, there are very 
strong quantum-mechanical forces tending to make the intrinsic 
angular momenta of neighboring atoms line up. This results in domains 
of macroscopic size having net magnetizations corresponding to about 
two aligned electron moments per atom on the average. Applying a 
magnetic field causes those domains which are aligned in the same direc- 
tion as the field to grow until the iron finally reaches a saturated state 
of magnetization. Typically, saturation occurs at fields in the neighbor- 
hood of 10,000 to 20,000 gauss. Needless to say, removal of the applied 
field does not lead to complete randomization of the domains. The 
residual magnetization can be quite large, as in the case of special 
permanent magnetic alloys, or it can be quite small, as in the case of 
soft iron. These phenomena are characteristic of ferromagnetism. 
Although iron is the best known of the ferromagnetic materials, it is 
not the only one; cobalt and nickel also exhibit the same properties. 


In any case, all three mechanisms we have just described lead to a 
magnetization or magnetic moment per unit volume within materials. We 
now examine in detail how a magnetization distribution M(r’) leads to a 
magnetic field distribution B(r). 

To begin with we must first find an expression for the vector potential 
at position r resulting from a dipole m at position r’. To do so, we replace 
m by a tiny flat current loop and make use of Eq. (4-5-3). We change 
V’(1/|r — r’|) to —V(1/|r — r’|) and remove the integral sign. Remembering 
that (7 dA) = m, we obtain 


1 


A(r) = —-m xX a 


(4-6-1) 


Next we consider the vector potential A(r) arising from the magnetiza- 
tion M(t’). The little volume dV’ has a magnetic moment M(r’) dV’. Hence 
we write ‘ 


l 
A(r) = — M(r’) x V-——— dv’ 
all |r = 
space 
1 
= M(r’) x V —_ dV’ 
all | r 
space 
M ‘9 Ve M ra 
= -{ ile | Vy a) 
all jr — x’ all [r—r'| 
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We next make use of a simple theorem, the proof of which is left to 
the reader, to convert the first of these integrals to more convenient form. 
In general, for any vector function of space W(r), we can write 


| Ve x Windy = | fi x W(r) dA (4-6-3) 
vol V surface 
Applying this theorem and observing that M = 0 at infinity, we note that 
M , £ 
{ vx ped ay = [ fix ewe aon ane) 
fall |r ky | surface |r oak: | 
space at oo 
Thus we have 
Vv’ x M fa 
ae = | ) av’ (4-6-5) 
all |r ak 


space 


Comparing this with our usual equation for A in terms of current 
density, we note that a magnetization distribution M can be replaced entirely 
by an equivalent current distribution j,, where 


in =VxM (4-6-6) 


In general, all current distributions can be considered as consisting 
of two parts—that part which is unassociated with the magnetization and 
which we shall call j, (free current) and that part which can be used to 
replace the magnetization distribution (j,). We write our basic equation 
for B as follows: 


V x B = 4nj = 4aj- + 40(V x M) (4-6-7) 
Rewriting Eq. (4-6-7), we obtain 

V x (B — 42M) = 4nj, (4-6-8) 
We now define a new vector field H at each point in space as follows: 

H = B - 4xM (4-6-9) 


We see then that H obeys the same differential equation with respect to the 
free currents as B obeys with respect to all the currents. That is, 


V x H = 4aj; (4-6-10) 
We can, of course, rewrite Ampere’s law as 
} Hal = ar | jr- fi dA (4-6-11) 
. boweead 


by C 
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Again, we must issue a caveat. The fact that H and j, are related by the 
same differential equation as B and j does not mean that we calculate H 
from j, in the same way as we calculated B from j. There is, for example, 
no equivalent to the Biot-Savart law to make use of in calculating H. Indeed, 
we can have H with no j; at all—witness the case of a permanent bar magnet. 

At this point we must interject a small bit of philosophy. It is customary 
to call B the magnetic induction and H the magnetic field strength. We 
reject this custom inasmuch as B is the truly fundamental field and H is a 
subsidiary artifact. We shall call B the magnetic field and leave the reader 
to deal with H as he pleases. 

Returning to Eq. (4-6-9), we note that in many materials there is a 
proportionality between H and M expressed as M = xH where x is called 
the magnetic susceptibility. We can then write, letting u = 1 + 4zz, 


B = «H (4-6-12) 


The constant p is called the magnetic permeability. 

As we have seen, we can calculate B everywhere by replacing M with 
its equivalent current distribution V x M. However, we often find our- 
selves with a discontinuity in M at the interface between two materials, 
and it is useful to derive an expression for the equivalent surface current 
density at the discontinuity. We define what we mean by surface current 
density in a manner analogous to our definition of volume current density j. 
If a set of charges g; are moving on the surface within an area AA and with 
velocity v;, then the surface current density k is given by 


k= lim —+ (4-6-13) 


We now examine the interface between a region (I) with magnetization 
M, and a region (II) with magnetization M,, (see Fig. 4-6). We place a gauss- 


Fig. 4-6 We find the equivalent surface current at the boun- 
dary between two magnetized media. 
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ian surface of negligible thickness and area AA across the boundary and 
then evaluate k,, AA. 


ky AA = | in V (4-6-14) 
vol 
Remembering the theorem of Eq. (4-6-3), we can write 
ky, AA = | V x Mav 
vol 


Hence we conclude 
ky = ft x (My — M) (4-6-15) 


where the normal fi goes from region I to region II. 

We learned in our last section that it is possible to find a potential 
function ¢g,, such that B = —VQ,,, provided that we could divide the current 
distribution into loops and we stayed clear of the region within which the 
current was flowing. We will now see how a distribution of magnetization 
M leads to such a potential function. Moreover, we will see that if we use H 
rather than B then we will not be excluded from a region where the magnetiza- 
tion is not zero. We assume for the purpose of the present calculation that 
the free current is zero. The fields which the free currents produce can be 
added in separately. 

The basic differential equations when j, = 0 are just 


V xs 0 
V:-B=0 


The first of these equations permits us immediately to define a function 
@y at each point in space such that 


(4-6-16) 


Py(co) = 0 and H = —Voy 
The function is obviously the line integral of H from the point in question 
to infinity. 

@y(P) = | H- dl (4-6-17) 

P 

The second equation can be rewitten as 

V:-B=V-(H + 44M) 

= —V oy + 4nV : M 


= 0 
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Hence we conclude that gy satisfies the Poisson equation 


V-On = —4tpm (4-6-18) 
where 
Pm = -—V°'M (4-6-19) 


We now recall the uniqueness theorems we learned in electrostatics. 
There is one and only one solution to Eq. (4-6-18) that goes to zero at 
infinity. Hence by analogy to electrostatics we can write 


_ Pt ae _ 
Qy(r) = {, = (4 6 20) 


space 


Thus we can replace a distribution of magnetization M by an equivalent 
“charge” distribution equal to —V -M. We can then use this charge dis- 
tribution to calculate H in precisely the same way as a normal charge 
distribution would be used to calculate E. 

In the event that we have a surface of discontinuity between two 
regions of different M, we need to calculate the equivalent surface “‘charge”’ 
density a,,. Referring back to Fig. 4-6, and making use of Gauss’ theorem, 
we find 


Om AA -| Pm dV 
vol 


-| (V-M) dv 


—fi- (My — M) AA 


Thus 
Om = —f- (My, — M) (4-6-21) 


where fi points from region I to region II. 

Before we apply these results, we will note the general boundary 
conditions for the behavior of B and H in crossing the interface between 
two regions of different magnetization. We assume no free current at the 
interface, and hence we have V x H = 0 and V- B= 0. By complete 
analogy with electrostatics (see page 50), we conclude that the normal 
component of B and the tangential component of H are continuous at the 
boundary. 

We will illustrate these methods for finding the fields arising from a 
magnetization distribution by considering two rather simple examples. 

As our first example we take a sphere of radius R and uniform mag- 
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netization M. We would like to find B and H for all points in space, both 
within and outside the sphere. For convenience we set our coordinate system 
up with its origin at the center of the sphere and its z axis pointing along 
M. We let the coordinates r’ and @’ refer to a point within the sphere. The 
coordinates r and @ will refer then to the point at which we wish to determine 
the fields (see Fig. 4-7). 

A cursory glance at our problem indicates that the simplest way to 
proceed would have us replace M by its equivalent charge distribution. 
We would then be able to make use of the techniques we learned in electro- 
statics for determining the potential in the event of cylindrical symmetry by 
means of a multipole expansion. The equivalent charge distribution is all 
on the surface and has magnitude given by 


On = M-fi = Mcos & (4-6-22) 


Fig. 4-7 The magnetic field due to a uniform spherical distribution of 
magnetization M. Outside the sphere, B and H are equal and appear to 
come from a perfect dipole of magnitude $xR*>M. Within the sphere 
B = 8xM/3 and H = —4xM/3 as shown. 


a 
Saar FESS 


2 = 
Be eee =< 
as aa ia 
™= ~ 
Ss \ 
x \ 
\ 
\ \ 
\ \ 
\ \ 
a 
| 
| 
| i 
| | 
| | 
] | 
/ / 
/ / 
, wf 
/ y) 
“a ve 
va “ae 
Pa 7 
Se Ba ee ve 
ee 


160 TIME-INDEPENDENT CURRENT DISTRIBUTIONS; MAGNETOSTATICS 


The “‘potential” @,, at points outside the sphere is given by 


Ont i > — On (0’) R" P,(cos 6’) dA’ Ms (4-6-23) 


n=0 surface 


Remembering that dA’ = 22R? dcos 6”, letting u’ = cos 6’, and making 
use of Eq. (4-6-22), we obtain 

os) 6 Rrt2 1 

our = R) = 2nM ee | 


=0 


PW) PCH’) du’ 
Ai 
Applying the orthogonality condition for the Legendre polynomials, 
we obtain 


cos 8 
r2 


(4-6-24) 


Oy(r = R) = $xR°M 


We conclude then that a sphere of uniform magnetization produces a pure 
dipole field outside itself. The fields outside the sphere are given by 


H(r 2 R) = Be 2 R) = —Vox 2 R) 


OPn 1 (09n \, 
i ( or )r r ( 00 : 
4nR>M a 
= : (2. cos 0¢ + sin 06) (4-6-25) 
: 
Within the sphere, making use of Eqs. (2-14-21) and (2-14-23), we find 
Oy(r S R) = $nMr cos 0 = $nMz (4-6-26) 


Hence the H field is a constant within the sphere: 
H(r Ss R) = —VOy = —$nM (4-6-27) 
The B field inside is also constant and is given by 
Bir < R) = H(r < R) + 42M = aM (4-6-28) 
It is interesting to check the boundary conditions on B and H. At the surface, 
the tangential H fields and the normal B field are continuous, viz., 
H,(in) = (H- 6)6 
= ($nM sin 6)6 
= H,(out) (4-6-29) 
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and 


B,(in) = B(in)-¢ 


8 
= == Moos 6 


= B,(out) (4-6-30) 


We next consider a cylindrical bar magnet with uniform magnetization 
M throughout. We assume M to be parallel to the axis of the cylinder. 
As we have seen, the entire bar magnet can be replaced by a solenoid of 
current with surface current density given by k,, = M x fi[see Eq. (4-6-15)]. 
The magnetic field along the axis of the bar magnet can thus be obtained 
directly from Eq. (4-3-6) if we replace NJ by |k,,|. We have then 


B(point on axis) = 4nM(cos 6, — cos 63) (4-6-31) 


where 6, and 62, as before, are the half-angles subtended by the two ends 
at the point of interest. 

To obtain a qualitative feeling for the directions and magnitudes of 
B and H at other points both within or outside the magnet, we replace it 
with its equivalent charge distribution. Referring to Fig. 4-8, we see that the 
H field is opposite in direction from the B field within the magnet and is 
continuous everywhere except at the ends of the magnet. The B field is 
discontinuous on the circumference of the magnet but is continuous at 
the ends. The fact that B approaches 41M in the center as the magnet 
becomes longer is now apparent because of the weakening of the H field at 
that point. Needless to say, a detailed quantitative analysis of the fields, 
while not difficult, is a bit messy and will not be attempted here. 


Fig. 4-8 A bar magnet with uniform magnetization gives rise to the H 
field shown in (a) and the B field shown in (6). 
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4-7 THE ENERGY OF A STATIC CURRENT DISTRIBUTION; 
FORCE AND TORQUE ON A MAGNETIC DIPOLE 


As we have seen, a static current distribution j leads directly to a vector 
potential, which in turn gives rise to a magnetic field B. Naturally there is 
an energy that is associated with this current distribution or with the resultant 
fields, and, as we anticipate, this energy will be quite analogous to the electro- 
static energy we found for a static charge distribution. 

In evaluating the magnetostatic energy of a current distribution, we 
will ignore the electric field which is needed to keep the current flowing 
and whose energy we can already determine by means of Eq. (2-7-4) or Eq. 
(2-7-6). We consider then that a current density distribution j(r) leads to a 
vector-potential distribution A(r). Were we to multiply our current density 
at each point in space by the numerical factor a, then the vector potential at 
each point in space would be multiplied by the same factor «. Consider 
now that we begin with a given value of a and increase it by dx. Assume also 
that we carry out this increase in a time dt. While we are changing from «A 
to (« + da)A, we are creating an electric field everywhere in space equal to 


1 [d 
i (Fa [see Eq. (3-5-4)]. The work being done on these charges by 


the electric field per unit time is [see Eq. (4-1-5)] 


dw : 1 do 
a - (caj) (-+ an dV 
space 
di 
= -{ a (A-j) dV (4-7-1) 
all t 


space 


Since these forces are acting to prevent us from building up the current, 
the internal energy is being increased as we do build it up. The total mag- 
netostatic internal energy of our final current distribution is thus 


1 
Ur- | ads | j-AdV 
0 all 
space 
1 , 
= — j- Adv (4-7-2) 
2 all 


Space 


The current density above includes everything, magnetization as well as 
free current. Accordingly, we can replace j by (1/42)(V x B). We obtain 
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then 
1 
Uz; = — (V x B)-AdV 
8x Jan 
space 
1 
——— | Vx adv + | (B-V x A)dV 

8x all all 


space space 


all 
at oo space 


1 
= { @x ayaa + | B? dv 
87 urface 


Since B goes to zero at least as fast as 1/r? and A goes to zero at least as 
fast as 1/r, the first integral goes to zero at infinity. We are then left with 


1 2, 
U,; = «|? dV (4-7-3) 


In the event that the magnetization M is proportional to B, we can 
calculate directly the work we need to do to set up a distribution of free 
current jy. The total vector potential A will now be proportional to j,, 
and the energy U, would be given by 


Up = > jp-AaV (4-7-4) 


Up = B-Hdv (4-7-5) 


Returning to Eq. (4-7-2), we can express the total energy of a static 
current distribution in slightly different form. We remember that 


Gi | ie) dv’ 
all 


rr 


space 


Hence 


-+( (es 10) 30) ay av (4-1-6) 


(an 


all 
space 
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It is interesting to consider the situation where our current distributions 
can be broken into N current loops. In that case Eq. (4-7-2) can be rewritten 
as 


N 
Uwe s ie } Ail, (4-7-7) 
i=1 Ci 


where J; is the current in the ith loop C;. Using Stokes’ theorem, we can 
change the line integrals into surface integrals. 


N 
i=1 


N 
= SS yi al B ut f; dA; (4-7-8) 
2 5: 
We define the magnetic flux ®; through the ith loop by the equation 
®; = | B = fi; aA; (4-7-9) 
8; 


Because V : B = 0, the choice of surface over which to carry out this integral 
is arbitrary, as long as the surface is bounded by C;. The energy is thus given 
by 


N 
Us= > ¥ 1, (4-710) 
t=1 


The flux through the ith loop has a contribution to it arising from the 
current in each of the other loops as well as from the ith loop itself. The 
contribution to the flux through the ith loop by the current in the jth loop 
is clearly proportional to that current. Thus we can write 


N 
®= >» Lil; (4-7-11) 
j=1 


The constants of proportionality L;; are called the coefficients of inductance. 
In particular the coefficient L;; is often called the self-inductance of the ith 
loop. 

We notice a remarkable formal resemblance between the equations 
we are developing now and the ones we developed earlier for a set of charged 
conductors. We continue the analogy by proving that L;; = L,;. To find 
the coefficient L;,; we allow a current / to flow in the jth coil and evaluate 
the flux through the ith coil. We then divide the flux by J. 

1 


L;; = | Bs aA; 


1 
-7¢ aa 
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Cj Ir; = r,| 


Hence 


Li; => $ fas Ir — (4-7-12) 


Obviously L;; = L;;. Thus if a current J in the ith loop gives rise to a flux 
® through the jth loop, then the same current J in the jth loop will give rise 
to the flux ® through the ith loop. 

The energy of our system can now be written as 


' But 


1 N N 


ee Lidl, (4-7-13) 


Back in our study of electrostatics we learned that a displacement of a 
set of conductors keeping the potential constant led to a change in energy 
which was equal and opposite to that obtained from the same displacement 
with the charges kept fixed. We now prove the “equivalent” theorem in 
magnetostatics. Let (6U,), be the change in energy of our system of current 
loops if we displace them, keeping the currents constant. Let (6U 3) be the 
change in energy if we displace the loops, keeping the flux constant. Then 
(6Us)o = —(6Us5),. 
The proof proceeds precisely as before. 


] 
(0Us): = > Dy OL hd, 
tJ 
On the other hand, 


(OUs)o = I 6L i]; a Loh], + = >? hol; 
By ij 


1 
tj ty 
a 


1 
i,j tJ 


1 
tJ 


—(dU 3), (4-7-14) 


I} 
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This result will be exceedingly useful in permitting us to evaluate the 
forces and torques on current loops (or magnetic dipoles) by the method of 
virtual work. In this method we imagine carrying out very small displacements 
of the system. For each displacement we can evaluate the amount by which 
the system’s energy will be changed. If we take care to see that the only 
forces (or torques) which act during the displacement are those that we wish 
to evaluate, then we can determine them by equating the work that they 
do to the change in energy of the system. 

Before we can apply this method to our current loops, we must first 
decide what, if anything, to keep constant in our displacement. To do this 
we will have to go a bit ahead of ourselves and make use of a result that 
we will derive in the next chapter, but which we are not altogether unfamiliar 
with. If a coil is displaced in such a way that the flux through it changes, 
then the integral of E - dlaround the loop will not be zero within the reference 
frame of the loop while this change is taking place. (This, we remember, is 
how dynamos work.) Since there is a current in the loop, this electric field 
will do a certain amount of work on the system in addition to the work 
being done by the forces we are trying to evaluate. If, on the other hand, 
we carry out the displacement while keeping the flux through the loop 
constant, then the only forces doing work are the forces of interest and we 
need not concern ourselves with any other work on the system. Hence the 
quantity which is relevant in determining the forces and torques on our 
loops is (6Ug)g. (In the analogous electrostatic situation the relevant energy 
change for determining the forces on conductors was dU, corresponding 
to a displacement in which the charges were kept fixed.) 

We apply this technique to find the torque and the force on a magnetic 
dipole with dipole moment y in an external magnetic field. For convenience 
we will assume that the external field is provided by one current loop C,. 
Needless to say, the result will be generalized to any applied field whatsoever. 
We approximate the dipole by a small current loop C, with area AA, 
and current /, such that w = (J, AA,)f, (see Fig. 4-9). 

Making use of Eq. (4-7-13), we have 


Oi 3L,,1,7 Be $L221,? + Lif, (4-7-15) 


Now, in the displacement we are going to make (either translating or 
rotating the dipole), the self-inductance of C, or C, will not be changed. 
We want to evaluate (Ug), but it is much easier to evaluate (6U,), and 
then take its negative. Hence we obtain 


(U5); = (6L,2)], I, 


1 
= (=| B, -f, aa) I, 1, (4-7-16) 
I, S52 
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u 
C, 
Fig. 4-9 The field produced by 
C, causes a torque and a force to 
act on the dipole represented by 
(Ga C2. 


where B, is the field at C, due to C,. Inasmuch as the loop C, is of infinitesi- 
mal size, we can remove the integral sign with the result that 


(6Ug); = o(B, I, AA, fi.) 
= I, AA, 6(B, - fz) 
= 5(B, - f,) (4-7-17) 


Finally, we write down (6U,)q. 
(6Ug)o = —(6U3); 

= —0(B, - fi) (4-7-18) 
We are now ready to find the torque on the dipole. To do so we make a 
rotational displacement of the dipole in various directions (keeping the flux 
through it constant) and note the change in energy. The drop in energy 
for a given angular displacement 60 is greatest if we turn mw toward By. 
Hence the torque acts to turn w in that direction. The magnitude of the 
torque is given by 


|r|60 = —yd(B,-f.) = +B, sin 6 60 
We have then 
tc=n x B, (4-7-19) 


Next we can evaluate the force on the dipole. We move the dipole 
around, keeping the direction of fi, fixed, and observe the change in energy. 
The force is in the direction in which the energy is decreasing most rapidly. 
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Thus 
F = pV(B, - fi2) 


= (#- VB, (4-7-20) 


Even though we have derived these expressions for an external field resulting 
from one external loop, they are completely general. We can add the external 
field due to any number of loops together; Eqs. (4-7-19) and (4-7-20) will 
still hold. 


4-8 THE MOTION OF A CHARGED PARTICLE 
IN A CONSTANT MAGNETIC FIELD 


Quite a while ago we discovered that a charged particle moving in a magnetic 
field experiences a force on it at right angles to its velocity [see Eq. (3-5-17)]. 
We will now take a closer look at this force and make use of it to calculate 
the orbit of the particle in a variety of interesting cases. 

We begin with the very simplest, a constant field B = Byk, and assume 
that an observer sitting in the laboratory sees the charge as having mass m. 
(Needless to say, m = mo/,/1 — v97/c? where vp is the velocity of the charge.) 
Inasmuch as the field does no work on the charge, its velocity and hence its 
mass remain constant. 

Let r = xi + yj + zk be the position vector of the charged particle. 
Then, applying Eq. (3-5-17), we have 


a’ gB, de®). @Bs (dy. “de. 
dP? me dt ~~ me \de' dt ee 
Separating the equation into component parts, we obtain 
d*x dy 
1. de = Wo dt 
d*y dx 
a? irs ; 8 
dz 
3 dt= = 


where Wo = qBy/mc. The number «, is often called the cyclotron frequency. 
The third equation yields 


z= Vo,t + Zo (4-8-3) 
The second equation can be integrated once and then inserted into the first 
equation: 

d 


y 
a” = + const (4-8-4) 
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With no loss of generality, we can choose the origin of our coordinate 
system so as to set the constant equal to zero. Then 


d*x , 

ya 80 (4-8-5) 
and 

x = Roos (wot + 4) (4-8-6) 


Substituting back into Eq. (4-8-4) and integrating for y, we get 

y = —Rsin (mot + 6) (4-8-7) 
The equations for x and y correspond to a circular path with a positive 
particle moving clockwise as seen from the positive z direction. The path 


in three dimensions is, of course, a helix. The radius of the helix can be 
determined from differentiating x with respect to time: 


a = —Ra@, cos (Wot + 5) 


Hence the maximum value of dx/dt is Rao. If the particle speed is vp, we have 


Doon @ 2 2 
R°@o* + Vo,” = Uo 


or 

jee we a es (4-8-8) 
If vo, = 0, then 

pao — Meo _ Pot (4-8-9) 


Do qBo qBo 


It is useful to put Eq. (4-8-9) into a numerical form corresponding to 
the situation where the charge q is that of an electron and its momentum is 
given in eV/c (see Sec. 3-2). We first observe that ppc = BE where both po 
and E are expressed in standard cgs units [see Eq. (3-2-34)]. This changes 
Eq. (4-8-9) into the form 


R = PE cers (4-8-10) 
qBo 


We next recall that the energy in electron volts of a particle having momen- 
tum expressed in eV/c and mass expressed in eV/c? is just 


Evy) = VPevic) + Mev /c2) (4-8-11) 
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Next we obtain its energy in ergs by multiplying with the conversion factor 
e/299.8 [where 299.8 = 107 8 (velocity of light in cm/sec)]. 


e 
Eggs) = 709.8 V Peeve) a Moevie2) (4-8-12) 


Finally, inserting Eq. (4-8-12) into Eq. (4-8-10) and remembering from Eq. 


(3-2-37) that B./péevje) + Movevic2) = Plevje» We Obtain 


€Pevic) 

R= 4-8-13 
299.8qBo ( ) 
If'g = e, then we have the particularly simple result corresponding to an 
electron or proton (or any other particle with one quantum of charge). 


= Prev /c) 4-8-1 
He 299.8 Bo naale) 


4-9 THE MOTION OF A CHARGED PARTICLE 
IN CROSSED ELECTRIC AND MAGNETIC FIELDS 


The solutions to orbit problems involving both electric and magnetic fields 
are often expedited through the judicious use of the Lorentz transformation. 
We will illustrate this point by examining in some detail the behavior of a 
moving charged particle in the case where both the electric and magnetic 
fields are constant in space and time and at right angles to one another. We 
choose our coordinate system, as shown in Fig. 4-10, so as to have 


B=Bk E= Ej (4-9-1) 
Obviously, to solve the problem completely, we will have to specify both 
the initial position rp and the initial velocity vy of the particle. To simplify 
matters somewhat, we will let vy lie along the x axis; generalizing to arbitrary 
Vo is straightforward and is left to the reader. 


Fig. 4-10 To find the orbit of a particle in crossed electric 
and magnetic fields we transform to a system in which one 
of the fields is zero. 
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Now, it is not difficult to sclve the problem in our given coordinate 
system. The differential equations must be treated carefully, inasmuch as 
mis no longer a constant of the motion as it is when no electric field is acting. 
We write 


By dx), By dy; 


However, rather than solving these equations in a straightforward manner, 
we will use a bit of “trickery” which will serve to simplify the problem 
enormously. 

Let us jump onto a coordinate system L’ which is moving with velocity 
V = Vi along the positive x axis (V is not in general the same as vp, and 
is not to be confused with it). In &’ we have 


~ 
e 


V 
EF’ = 1( Bo =  Bo)i 
V Es 
ny = i( B — bo) 


where 


(4-9-3) 


| 


: /1 — V/e 
Now it is clear that if E, # By then there is some V for which either E’ 
or B’ is zero. To be specific, if Eg < By and V = (E/Bo)c, then E’ = 0. 
If By < Ey and V = (B)/Ep)c, then B’ = 0. In either case the problem 
becomes much simpler in the new system with the indicated choice of V. 
We will solve the problem for the case where Ey < Bo; the alternative case 
we leave to the reader. 

In our new system we have only a magnetic field 


25 ZN 7 
B = (Si = B’k (4-9-4) 


The particle moves in a simple circle, at constant speed and hence with con- 
stant mass. The initial velocity is (making use of the velocity transformation 
equation) 


(4-9-5) 
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The mass of the particle in this system is just 


m = Ee (4-9-6) 


~ JT Gee 


where mp is its rest mass. The frequency of rotation in this system is 


B’ 
game (4-9-7) 
m'c 
The radius of the circle is, of course, 
pe (4-9-8) 


Wo 
and the equation of the orbit is given by 


/ 


x = RF’ sin aol’ 
y = R'cos@ot’ — R 
(Remember that R’ and wo are just numbers calculated according to the 
above equations. We assume that x’ = y’ = Oat?’ = 0.) 

To go back to our original system we just substitute for x’, y’, 
according to the Lorentz transformation: 


x’ = (x — V2) 
yoy 


f= ply ie 
=y a 


This will give us the precise, relativistically correct orbits in the 2 system. 

In the circumstance that vg, just happens to be equal to V, we have 
Vox = 0, and hence the particle is initially stationary in the X’ frame. Since 
there is no electric field in the &’ frame, the particle remains stationary. 
It thus continues to move with velocity vp,i in the © system, experiencing 
no net force. (Of course we expect this inasmuch as gE = —qv/c x B at 
this point.) 


4-10 LARMOR PRECESSION IN A MAGNETIC FIELD 


We have already learned that a magnetic dipole with moment p in an 
external field B has a torque on it given by t = » x B [see Eq. (4-7-19)]. 
Being basically the result of circulating charge, this magnetic moment is 
often accompanied by angular momentum L along the same axis as up. 
In this case, as we shall see, the magnetic-moment vector will precess about 
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the direction of applied field with what is known as the Larmor frequency 
@,. 

We will illustrate this process by considering a classical electron in an 
orbit about a nucleus. Let us begin by turning off the applied field and 
determining the magnetic moment corresponding to the orbit, considering 
it as a current loop. If e is the charge of the electron (e is a negative number) 
then the current is just 


I 


e 
Inu = —~ = — X (number of turns per second) 
c c 


We let A be the area of the orbit. We then recall from Kepler’s law that the 
rate at which area is swept out per unit time is a constant and is equal to 
L/2m where L is the angular momentum of the electron and m is its mass. 
Hence. 

e l 

c period 
2 dA/dt 
cA 


Lae = 


= eL 
~ ImcA 


Remembering that the magnetic moment is equal to the current times the 
area, we conclude that 


eL 
= —— 4-10-1 
YO ( ) 
We have then 
= ak ning LxB 
dt 2mc 


eB 
Pa 8 4-10-2 
OL 2mc \ ) 
we conclude that 
aL 
a o, x L (4-10-3) 


This is the kinematical equation for the precession of the vector L about 
@, with angular frequency @,. 
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[In the case of a “spinning” elementary particle the relationship 
between L and yp is replaced by the more general one: 


eL 

= g>— 4-10-4 

OY ope ( ) 

The proportionality constant g is called the gyromagnetic ratio. For the 
electron, g is very close to 2.] 


4-11 A METHOD OF MEASURING g — 2 


The measurement of the gyromagnetic ratio of the electron or the muon 
has had considerable significance in formulating and testing the modern 
quantum theory of electrodynamics. Both these particles appear to have no 
important interactions with matter except through electromagnetism, and 
hence it is possible to calculate g quite accurately, making use of theories 
that have been developed in the last 25 years. In either case, g turns out 
to be equal to 2 + 6 where 6 is a very small positive correction of the order 
of z45. Needless to say, accurate measurements of g — 2 have been essential 
in determining the validity of the theory. So far there has been no significant 
discrepancy. Asa means of illustrating the power of the relativistic techniques 
we have developed so far, we will describe in detail the first experiment! 
to determine the value of g — 2 for the muon. 

We should begin by saying a few words about the muons themselves. 
Muons do not occur naturally in matter. They are the predominant compo- 
nent of the cosmic radiation at sea level, being the decay products of un- 
stable pions and K mesons produced at high altitude, but they only live 
for about 2 psec on the average in their own reference frame and then decay 
into electrons and neutrinos. Hence, to do experiments with them, one must 
begin by producing pions, preferably at an accelerator, and then allow these 
pions to decay. Now, in 1956 it was discovered? that pion decay does not 
conserve parity. (In simple terms, observing a pion decay in a mirror would 
not lead to the same set of physical laws as observing it directly.) In fact, 
negative muons are emitted with their angular momenta pointing along 
their directions of motion. Positive muons are emitted with their angular 
momenta pointing opposite to their directions of motion. Furthermore, 
if these positive muons were stopped in matter and allowed to decay, the 
direction of this angular momentum (or spin) at the moment of decay 
could be determined by the distribution in directions of the emitted decay 
electrons. 


1G. Charpak et al., Phys. Rev. Letters, 6:128 (1961); Phys. Rev. Letters, 1:16 (1962). 


?R. L. Garwin, L. M. Lederman, and M. Weinrich, Phys. Rev., 105: 1415 (1957). J. L. Friedman 
and V. L. Telegdi, Phys. Rev., 105: 1681 (1957). ~ 
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This result leads quite naturally to a technique for determining g — 2. 
If the polarized muon is introduced into a magnetic field, the direction of its 
angular momentum will precess, as we have just shown. In addition, the 
muon itself will turn through its orbit. After spending a given amount of 
time in the field, the muon can be brought out and the angle between its 
spin and its direction of motion can be measured. As we shall prove, this 
angle is directly proportional to g — 2, and its measurement offers a very 
sensitive and direct test of the validity of our theory of quantum electro- 
dynamics. 

In Fig. 4-11 we illustrate the apparatus used in the g — 2 experiment. 
The muons were introduced into a large magnet, slowed down somewhat, 
and then trapped in orbits for awhile. After a number of turns they left 
the magnet, were stopped within a block of material, and their decay dis- 
tribution was measured, yielding the directions of their final polarizations. 

To follow what is going on, it would be most convenient to move along 
with a coordinate system attached to the muon. Inasmuch as the muon’s 
own coordinate system is not an inertial system, we might imagine our- 
selves within a bit of difficulty here, but we can get out of it by a very simple 


Fig. 4-11 A detailed drawing of the apparatus used for a measurement of g — 2 of the muon 
{G. Charpak et al., Phys. Rev. Letters, 6: 128 (1961); Phys. Rev. Letters, 1:16 (1962)]. A muon 
enters the magnet and is passed through a beryilium block for energy degradation. It then 
passes through a large number of orbits until it emerges and is stopped in a target 7. The 
numbers refer to scintillation counters that are used to detect either the muon or its decay 
electron. 
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ruse. We will follow the muon’s progress with a succession of inertial systems, 
each moving with the muon’s speed, and tangential to its orbit at some point. 
We will then relate the observations within one coordinate system to those 
within the next system down the line until the particle finally leaves the field 
altogether. 

We will make the simplifying assumption that the charged particle is 
moving with speed v at right angles to the field and that the motion all takes 
place within one plane. (Needless to say, this assumption is not precisely 
true in reality and a careful analysis of the orbits is necessary.) Subject 
to this assumption, an observer in each of our moving coordinate systems 
sees a magnetic field equal to yB where B is the laboratory field at that point 
and y = 1/,/1 — v?/c?. 

We begin by examining an infinitesimal portion of the particle’s orbit 
as seen from both the laboratory system X and from the moving system 
x’ which is tangential to the particle’s orbit at this point (see Fig. 4-12). 
In either case the orbit appears to be a segment of arc, but within LD’ it 
appears much foreshortened and hence of smaller radius than in £. The 
Sagitta of the arc is s as seen in X and s’ as seen in X’. The chord of the 
arc is d as seen in © and d’ as seen in X’. Obviously s’ = s and d’ = d/y. 
If d6 is the angle of arc as seen by and dé’ is the angle of arc as seen by 


_ 


Fig. 4-12 A segment of the path of a 
particle in a magnetic field as seen by an 
observer in the laboratory (Z) and by an 
observer with the velocity of the particle and 
tangential to its path (2’). 
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Sthen 
dd 
com ae cael) 


Now what about the time to go from one end of the segment to the 
other? Since 2X’ is the particle’s rest frame at the moment, we can write 


dt = y dt’ (4-11-2) 


Hence the rate of change of direction as seen by &’ can be related to the 
rate of change of direction as seen by 2. 


dy’ do 

dt —s dt 
But d6/dt is just the cyclotron frequency of the particle, eB/mc, where B 
and m are as measured in the laboratory. This leads to 


di y’eB_yeB 
dt’ ymoc moc 


(4-11-3) 


(4-11-4) 


Next we must find through what angle dg’ the spin has precessed 
during the same time interval dt’. The magnetic field as seen by L’ is yB, 
and hence the Larmor frequency is 


dp’ __—s eyB 

aC 2mMoc 
Thus the angle between spin and orbit direction, as seen by &’, has changed 
from one end of the arc to the other by an amount 


(4-11-5) 


é ee “We, 
d(y’ — 0) = @g — 2) ae dt 
eB dt 
= ai = 4-11-6 
( 2) 2MoC ( ) 


We now transform to the moving system which is tangential to the 
next segment of arc (2”). Since the relative velocity of 2” and X’ is infinitesi- 
mal, the angle between spin and orbit at any given point on the orbit does 
not change with this transformation. Thus we have 

(y’ = Ona of = (g” z= O \eeiening of (4-11-7) 

first segment second segment 

Following the particle over the second arc, as seen from &”, we come 
up with the same Eq. (4-11-6) for the change in angle between orbit and 
spin direction. We continue this procedure until we leave the magnet. 
Adding together all the changes in angle between spin and direction of 
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motion, we conclude that 


yw’ = angle between direction of muon spin and direction of motion 
upon leaving the magnet, as seen by observer moving with muon 


eB 
2moc 


G2 t (4-11-8) 
where B is the average field experienced by the muon and f is the time spent 
in the magnet. 

Finally, we must bring the muon to rest. The process of stopping 
the muon is equivalent to applying an electric field opposite to its direction 
of motion. An observer riding along with the muon sees the same electric 
field but no magnetic field at all. Consequently, he does not see any change 
in the muon’s spin direction during the slowing-down process, and the 
final angle, in the laboratory, between the direction of the muon and the 
direction of its spin is given by Eq. (4-11-8). 

The experiment we have just described was the first of a series carried 
out at the CERN laboratory in Geneva during the past 10 years. It is clear 
from an examination of Eq. (4-11-8) that a basic limitation on the accuracy 
of the measurement is the finite lifetime of the muon, about 2.2 psec in 
its own reference frame. To make ¢, the laboratory time, as large as possible, 
one needs to increase the energy of the muon and thus make use of time 
dilation. The most recent experiment! uses muons having a y of 12 and 
hence times of the order of 25 psec. Typical values of y’ are about 2000°. 
To avoid the difficulties of stopping such energetic muons, the experimenters 
actually looked at decays in flight. For a detailed description of the experi- 
ment, the reader should read the article cited in footnote 1. The experimental 
result has come quite close to that predicted by the theory. 


5 
(2*) = 116.616 431 x 10-8 
2 exp 


4 


g—2 e? e 
Le Pe ee ez 
( 2 a 2nhc “iin nh? c? se 


= 116,560 x 1078 


(In the above, e is the charge of the muon and # is Planck’s constant divided 
by 2z. The combination of e?/hc, usually denoted by the symbol «, is dimen- 
sionless and goes by the name of fine-structure constant. Its value is about 
i 
737°) 
We should note that the value of g — 2 for the electron has also been 
determined by this method. A long series of experiments have been carried 


' J. Bailey et al., Phys. Rev. Letters, 28B: 287 (1968). 
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out by Prof. H. R. Crane and collaborators and are described in a recent 
issue of Scientific American.' 


4-12 THE MAGNETIC STRESS TENSOR 


We consider next the methods whereby we can determine the force exerted 
on a distribution of currents by magnetic fields. We will find, in complete 
analogy with electrostatics, that this force can be expressed either in terms 
of a volume integral requiring a knowledge of the current and the field at 
each point in the distribution or in terms of a surface integral in which 
only the field on the surface need be known. We will again find that the 
surface integral will have an integrand which is the product of a stress 
tensor T with the normal to the surface fl. 

We begin by going back to our expression for the force exerted on 
an elementary charge g by a magnetic field B [see Eq. (3-5-17)]. 


=ivxB (4-12-1) 
For a distribution of current over a volume V, this becomes 
F= i x B) dV (4-12-2) 
Remembering that j = (1/4) V x B, we have 
F= = [, (V x B) x BdV (4-12-3) 


To convert this into a surface integral we take the components of F in- 
dividually and transform them. For example, 


B= Ge [i x B x Bay 
4n 


=z [ix (V x B)]-BaV 


1 1 0B? 
ee eee 
iq |, B VE: - > ay 
1 OB,?  OB,B,  0B,B 1 “OB 
= = = —~— — BV: B) - = dV 
4n (iz i oy + az VB) 2 GX 


1H. R. Crane, Scientific American, January, 1968, p. 72. 
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We remember first that V- B = 0. We next recall that the first three terms 
can be turned into a surface integral by following the same steps as we 
followed in proving Gauss’ theorem. We finally obtain, for the component 
F, 


x? 


1 ; om ; 
= — B Ae) VeB aaa 
F TF [@. ROY BeBn, Ray. = i i (B*) 
1 . Bn, 
= — (BY n, + BEB ne ee) — dA (4-12-5) 
4n Ss - 2 


In summary then, for the three components of F we can write 


F= | Tadd (4-12-6) 
s 
where 
2 
B? - > BB BB, 
1 5 ee 
Tae ie BB, B, eas 7 B,B, (4-12-7) 
B? 
BB, B,B, B,? - = 


A comparison with Eqs. (2-16-11) and (2-16-12) shows us a deep under- 
lying similarity between electrostatics and magnetostatics. This is not 
altogether surprising though because, as we have learned, if we had begun 
with magnetic monopoles rather than charges, then the roles of B and E 
would have been reversed. When we use the stress tensor to find the force 
on a volume, we pay no heed to the charges and currents in the volume, 
only to the fields on its surface. Hence the electrostatic stress tensor must 
involve E in precisely the same way as the magnetostatic stress tensor 
involves B. 

We will illustrate the use of the magnetostatic stress tensor by means 
of an example. We have a long cylindrical thin-walled tube of radius R 
carrying current J as shown in Fig. 4-13. The force on the tube acts so as 
to tend to collapse it. To counteract this force we can pressurize the inside 
of the tube with pressure P. We ask then for the value of P needed to precisely 
balance out the magnetostatic force. 

To solve this problem we pass a plane through the axis of the cylinder 
and call it the yz plane. We then calculate the magnetostatic force F exerted 
by the upper half (x = 0) of a unit length of the tube on the corresponding 
lower half (x < 0). We do this by integrating the stress tensor over the yz 
plane, remembering that fi = 1. We have then on the yz plane 


4-12 THE MAGNETIC STRESS TENSOR 181 


Zz 
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Fig. 4-13 We would like to find how much pressure to put on the inside of a tube of 
radius R to precisely counteract the magnetostatic force tending to collapse it. 


RB? 
— 0 0 
2 
l — B? 
ae 0 4-12-8 
e 4n ; Z ( ) 
— RB 
0 
: 2 
Tiesent (4-12-9) 
8x 
Integrating over a unit length, we have 
F= ee B?i dy 
OT. Nice, 


Since B = Ofor —R < y S R, we find 


fo) 2 
= x | (=) i dy 
4n R yi 
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To find P, we set the magnitude of F equal to 2PR, yielding 
P 


=5o (4-12-10) 


We shall come back to the stress tensor shortly when we deal with 
energy and momentum conservation in electromagnetism. At that time we 
will find that the electric and magnetic stress tensors coalesce to form a 
3 x 3 portion of a four-dimensional electromagnetic energy-momentum 
tensor, the other components of which are the energy and momentum density 
contained in the fields. 


PROBLEMS 


4-1. A long straight conductor carries current J. It is in the form of a cylinder of 
radius R with an off-axis cylindrical hole of radius b, as shown. The distance 
between the axis of the cylinder and the axis of the hole is a. Find the magnetic 
field in the hole. 


4-2. Making use of the stress tensor, prove that if the magnetic field is constant at 
all points on a surface surrounding a given object then there is no magnetic 
force acting on the object. 


4-3. A circular toroid with rectangular cross section, as shown, is wound on a core 


Ky 


ao \o 


Center 
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30 


having permeability y. N turns of wire are used, and a steady current / is put 
through the wire. 

(a) Find the fields H, B, and M at radius r within the toroid. 

(b) Find the vector potential A at the center of the toroid. 


An electromagnetic crane is constructed of a U-shaped steel yoke with 1000 
turns of wire carrying current J, as shown. The permeability of the steel is 1000. 
We would like to use it to lift a steel block of dimensions 30cm x 30cm x 120 
cm and having the same permeability. 


/| 


CS 


_| [ 


vo 60 | 30 


All dimensions in centimeters 


Estimate the magnitude of J in order that we just be able to lift the block 
as shown (the density of steel can be taken as 8). 


4-5. A long straight cylindrical solenoid of radius R is wound with N turns of wire 
per centimeter with each turn carrying current /. Find the pressure exerted on 
the inside of the solenoid at its midpoint. 

4-6. It is possible to simulate a portion of the orbit of a charged particle in a magnetic 
field by means of a current-carrying wire under tension. Consider a short segment 
of such a wire under tension T in a magnetic field B. Demonstrate that the 

T 
A 
Z| 
7 Orbit of particle of 
charge q with 
oe ed be q 


momentum p 
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path followed by the wire segment if it carries a current J corresponds to a 

portion of the orbit of a charge g having momentum p. (If g is positive, the 

direction of its motion along the orbit is opposite to that of the current.) 
Derive an equation relating p and q to J and T. 


An electromagnet is constructed with poles, as shown below. The field between 
the poles is in the direction indicated and is assumed to be constant at all points 
except at the boundaries where it drops to zero. A parallel beam of particles, 
each with mass m, velocity v, and charge q, enters the magnet at an angle of 
incidence @ and leaves the exit face with the same angle, having bent through 
2. The fringing field of the magnet will cause the beam to be focused in the 
vertical direction. The magnet length is /. 


Find the approximate distance the beam travels, after leaving the magnet, 
before it comes to a vertical focus. 


A block of conductor having height a and thickness b carries a current J, as 
shown. A magnetic field B is applied to the block, as shown. Assume that the 
current is carried by means of N electrons per cubic centimeter. 

Find the potential difference that is developed between the top and the 
bottom of the block. How can the sign of this potential difference serve to 
determine the sign of the charge carriers? (The phenomenon described here is 
called the Hall effect.) . 


if 
J Constant magnetic 
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4-9. Two circular loops, each of radius R and carrying current J, are placed parallel 
to the yz plane and centered about the x axisatx = —/andx = +1], respectively. 
Each produces a field in the +x direction along its axis. 


/ I ‘4 I 
oe 
N Ne 


(a) Find an expression for the fieid along the x axis. 

(6) Show that the first derivative of B, with respect to x is zero at x = 0. 

(c) For what ratio of / to R is the second derivative of B, with respect to x 
also equal to zero at x = 0? 

(d) Choose R = 50 cm and J = 10 abamperes. Fix / to conform with the con- 
ditions of part (c) above. Now plot B, as a function of x from x = —20cm 
to x = +20 cm on the x axis. (The circular loops considered above are 
called Helmholtz coils. They provide a cheap way of producing a fairly 
uniform but small magnetic field over a large volume.) 


5 
The Variation of the 
Electromagnetic Field 
with Time: Faraday’s Law, 
Displacement Currents, 
the Retarded Potential 


We enter now on what is undoubtedly the most exciting part of our voyage 
through the world of electromagnetism. While electrostatics and magneto- 
statics are interesting subjects, yielding results which are pretty and even 
surprising at times, they give no hint of the incredible beauty and richness of 


phenomenology that lies in store when we allow our currents and charge 
186 
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densities to change with time. We have already gotten a fleeting glimpse 
of the gold mine when we developed Maxwell’s equations and observed 
that an electric field can result from a time-varying vector potential. In this 
chapter we will explore some of the simpler consequences of this observation, 
leaving for our next chapter the crowning prediction—light. 


5-1 FARADAY’S LAW 


We begin by reexamining the important discovery made by Faraday in the 
last century. He observed that it was possible to induce a current in a con- 
ducting loop by changing the flux through it. This effect could be accom- 
plished in one of two ways, by moving the loop itself or by actually varying 
the magnetic field passing through the loop. In the first case, the cause of 
the current flow is the force exerted by the magnetic field on the moving 
charges in the wire. In the second case, the force is the result of a time- 
varying vector potential and its associated electric field. In either case, the 
integrated force per unit charge around the loop is proportional to the rate 
of change of magnetic flux ® through the loop. This result is known as 
Faraday’s law. 

Before proceeding we will define what is meant by electromotive 
force €. We must begin by choosing a direction around the loop C as 
shown in Fig. 5-1 and defining the normals to a surface covering the loop 
by means of the usual right-hand rule. As we said before, the loop is either 
stationary or moving. The force per unit charge on a charge which is fixed 
with respect to a given point on the loop is 


Se ee (5-1-1) 
q c 


where v is the velocity of the given point on the loop, B is the local magnetic 
field, and E is the local electric field. If we integrate this around the loop, 
we obtain the electromotive force 


Be $ (E+ 2x B)-d (5-1-2) 


Fig. 5-1 We show that the 
emf around a curve C is pro- 
portional to the rate of change 
of flux through C. 
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Needless to say, the electromotive force will tend to make a current flow 
around the loop. In any case, the two portions of the integral above corre- 
spond to the two cases we discussed earlier. The first part, 4, E - dl, is the 
result of the time-varying vector potential. The second part is patently the 


result of the loop’s motion. 
Consider then first the emf which results from varying the fields. 


Exoop a is 


stationary 
ae $5 Pde 


tv (ha 


where S is a surface covering the loop. Interchanging the order of differen- 
tiation and bringing the time derivative out of the integral, we obtain 


Broo = =— es a V x A ndA 
, c dt 
stationary 
- - < |B fidA 
1 /d® 
<a 5-1-3 
=(F dt a changing ( ) 


Next we consider the emf which results from a motion of the loop. 


Eaueto = $+ x B- dl 


motion 


But 
vx B-adl 


—(v x dl): B 


—(flux through area swept out by dl per unit time as a 
result of its motion) 


tl 


Integrating around the loop, we again conclude 


ar d® 
Eaue wo at 
motion due to motion 
of loop 


We combine this result with Eq. (5-1-3) to conclude in general that 


1 {/d® 
Fra _ ce (7), sources Ole) 
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We can obtain another expression for & by recalling Eq. (4-7-11). 
We had shown that for a set of N loops 


N 
oF y Lil; 
j=l 


and hence 


e& &, = electromotive force around ith loop 


we 
— c dt 
wo a, 
Sere aa 
F 


This result will be of great importance in the study of ac circuit theory. 

We should offer one rather important comment at this point. Equation 
(5-1-3) relates the emf around a stationary loop to the rate of change of flux 
through the loop. Note though that there need be no magnetic field at all 
at the loop itself. Furthermore, the rate of change of flux is simultaneous 
with the emf. Both these points indicate to us that there is no direct causal 
relationship between changing magnetic field and electromotive force. Both 
are in fact caused by a time-varying vector potential which is the result of a 
time-varying current distribution. Needless to say a change in current 
density at some point must precede a related change in vector potential at 
some other point by enough time to allow for the transit of information 
at the velocity of light. 

To develop some insight into Faraday’s law we will consider several 
examples of its application. The first of these is a situation we are already 
quite familiar with. We have two long coaxial thin-walled conducting tubes 
with radii a and b, respectively (see Fig. 5-2). A current J flows down the 
inner tube and returns on the outer tube. We now allow the current to 


Fig. 5-2 Two coaxial thin-walled cylinders carry a current J as shown. The inner cylinder 
has radius a and the outer cylinder has radius b. We change the current at a rate of dI/dt and 
ask for the electric field distribution as a function of radius. 
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change at a rate of d//dt and ask for the electric field distribution as a func- 
tion of radius. 

The simplest way of handling this problem is by finding the vector 
potential everywhere as a function of the current J. We consider the analo- 
gous electrostatic problem (see Sec. 4-2 for a discussion of this method of 
finding A) where the inner conductor has a charge density per unit length 
of 2 and the outer conductor has a charge density per unit length of —A. 
The potential in this electrostatic problem is 


—24 In formas nib 
o—< 0 forr=b 
—24 In forr <a 


We can now write down the vector potential for the problem at hand by 
replacing A by J and 9 by A,: 


— 20 In forasrsb 
Ag =< 237 In = forr <a (5-1-6) 
0 forr>b 


(2 int) F fore Srsb 


c b} dt 
2 dl 
E(n) = (< In +) = forrsa (5-1-7) 
0 for r > b 


Next let us find E,(r) by making use of Eq. (5-1-4) and evaluating ® 
through the indicated curve C. We first use Ampere’s law to determine 
B as a function of r. 

2nrB = 4nI 

we 


r 


The fiux through C is now determined by integrating B over the indicated 
area. ‘ 
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Oy, a 
@ = Se = — — 2/5 
[2a rin G18) 


To determine & we note that the only contribution to €& comes from the 
electric field E,(r) inside the outer conductor. Hence, for the indicated 
path C, 


€ = E{r) (5-1-9) 


Combining Egs. (5-1-8) and (5-1-9), we reproduce Eq. (5-1-7) for E,(r). 

Incidentally, the self-inductance per unit length of these coaxial 
cylinders can be obtained from Eqs. (5-1-9), (5-1-7), and (5-1-5). We take 
r = ain these equations and find 


L = 2In— (5-1-10) 


As our second example let us cunsider a circular loop of conducting 
material placed in a plane perpendicular to an applied magnetic field By 
(see Fig. 5-3a). The loop has radius J, self-inductance L, and a resistance 
around its circumference equal to R. We now reduce the applied field to 
zero with a linear time dependence: 

Bo 

B(t) = Bo — kt forO'Sits 5 (5-1-11) 
This change in magnetic field gives rise to a current in the loop. We would 
like to determine this current /(t) as a function of time. 

The total flux through the loop at any time is made up of two con- 
tributions. On the one hand, we have the applied flux, which is equal to 
nb? By for t < 0, nb?(By — kt) for 0 < t < Bo/k, and 0 for t = Bo/k. 


CTS By Fig. 5-3 (a) A loop of radius b having induc- 
tance L and resistance R is placed normal to a 


magnetic field as shown. The field is reduced to 
I(t) zero, leading to a current in the loop. [The 
indicated direction of /(t) will be considered the 

(a) positive direction. ] 
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On the other hand, we have the flux which arises from the current in the loop 
itself and which is equal to Li(t). The total flux is thus equal to 


@(t < 0) = nb? By 
a(0 pag 2) = 1bh?(By — kt) + LI(t) (5-1-12) 


o(cz ®) = 110 


Obviously, for ¢ < 0 there is no current flowing. During the time 
interval 0 < t < B,/k, Faraday’s law tell us 


IV 


By 1 d® 
<a | = Se 
e(0 2's k ) c dt 
nb*k ~=6ordLL dd’ 
a ieee (5-1-13) 
Now Ohm’s law tells us (see Sec. 4-1) that 
é 
i) = R (5-1-14) 
Substituting into Eq. (5-1-13), we find 
Lal nb*k 
oa + RI = i (5-1-15) 
Solving for J(t) and inserting the condition that 1(0) = 0, we have 
2 
i= AON: (Po Srer ae’) ford <1 = Bo (5-1-16) 
Re k 


For t 2 B,/k, I(t) satisfies the differential equation 
Bedi 


——+ RJ = 
cuxat = 
Solving for /(¢) and requiring that /(t) be continuous at t = B,/k, we obtain 
mb? k ReBo/Lk Ret/L Bo 
= — = (eReBolLk _ ~ Re eee ‘Als 
I(t) Re (e le fone = k (5-1-17) 


In Fig. 5-35 we have plotted /(t) as a function of time for three choices 
of k. As reasonable parameters we have taken b = 10 cm, R = 3p stat- 
volt/abampere, L = 1000 emu, and By = 1000 gauss. R 

An interesting point which occurs to us as we examine the three 
curves in Fig. 5-35 is that the area appears to remain constant as we vary k. 
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To check this out we can integrate the expressions in Eqs. (5-1-16) and 
(5-1-17). We find that 


en en 
a Re 
| indc= (5-1-18) 
° mb? By 
. R esu 


We can thus conclude that the total amount of charge passing any 
given point in the loop as we reduce the applied field to zero is dependent 
only on the initial flux through the loop and on its resistance. That this 
result is completely general for any loop whatsoever at any orientation 
with respect to the applied field is extremely simple to prove. We observe 
again that the flux through an arbitrary loop can be broken into the applied 
flux D, and the flux due to the loop itself. 


®=0,+ LI © (5-1-19) 


Fig. 5-3 (cont’d) (b) A plot of current versus time for various values of k (R = 35 statvolt/ 
abampere, b = 10cm, L = 1000 emu, B = 1000 gauss). 


I, abamperes 


Time, 107> sec 


(b) 
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Hence in general 


1 d® 
ot eae 
1 dd, Ld 
=o eee Se 5-1-20 
c at c at ( ) 


Integrating with respect to time, we have 


=| dI + R| rar = - =| d®, (S-1-21) 
€ Jo Cc Jo 


0 


feo) 
If we start with no current and end with no current, then | ai = 0. Hence 
we can write Q 


| Ah ®, (initial) — ®,(final) 


5-1-22 
, Re ( ) 


The observations we have just made serve as the basis behind the 
operation of the flip-coil method of magnetic field measurement. By using 
a ballistic galvanometer to integrate current and a coil whose resistance and 
physical dimensions are well known, we can map a magnetic field quite 
accurately. 

Incidentally, we might look to see what happens if the resistance in our 
loop is equal to zero (this happens, of course, in the case of a superconducting 
loop). In that case, using Eq. (5-1-20), we find 


d®,  Ldl 
7 (551223) 


Hence the total flux through the loop remains constant: 
®, + LI = const (S-1-24) 


Before we complete this section, we should go back to the Alvarez 
magnetic monopole experiment that we described earlier (see Sec. 3-6) and 
evaluate its sensitivity. We had at that time developed an expression for the 
emf around a loop linking the path of the moon rock if the latter contained 
a monopole charge Q'” [see Eq. (3-6-14)]. We can derive the same expression 
by noting that the total magnetic flux issuing from the monopole is just 
4x0. Hence each time Q passes through the loop, it changes the flux 
through it by an amount equal to 4xQ”. If the time for one pass through 
the loop is equal to T, then 


1 Ad 49 


&=- _ 
@ AN Ge 


(S105) 
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We now observe that if the loop'contains n turns of superconducting wire 
with inductance L, and the sample passes through the loop N times, then 
the change in current A/ can be obtained by integrating Eq. (5-1-23), leading 
to the result 


= 4nNno” 


Al Z 


(5-1-26) 

~~ In the actual experiment the coil had 1200 turns of wire and a self- 
inductance of 78 x 10° emu. The detectors used were sensitive to a current 
change of 1.4 x 10~!° abampere. The magnitude of the basic Dirac mono- 
pole charge is about 68.5 times larger than the electron’s charge in esu, 
and hence for N = 400 we would expect AJ = 25 x 107!° abampere. 
The experiment was thus sufficiently sensitive to easily detect one Dirac 
monopole. Twenty-eight lunar samples were examined with a total mass 
of 8.4 kg. No monopoles were found.! 


5-2. THE CONSERVATION OF ENERGY; THE POYNTING VECTOR 


To derive the energy conservation laws from Maxwell’s equations is remark- 
ably simple. We take the scalar product of one equation with E and of 
another with B, viz., 


1_ 6E 
E-(V x B)=—E-—" + 4nj-E 


1 
B:(V x E) = = Bae 


Subtracting the lower equation from the upper and using a simple vector 
identity, we have 


1 oa 


Multiplying by c/4x and rewriting, we have 


c Q (E* + B? 
ie Jee | lanai j-E=0 5-2-1 
v-(Zexs)+ o( Bn Jeo ( ) 
We define the vector S, called the Poynting vector, by 
S= rx 8 (52-2) 
4n 


‘For a more complete discussion see L. Alvarez, P. Eberhard, R. Ross, and R. Watt, Science, 
167: 701 (1970). 
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Integrating over any volume, we have 


= yA B? 
| sada += | (At )ar + | ane =o 
surface ot vol 8x vol 


of vol (5-253) 


Let us see if we can identify the various terms in this equation. The 
last term we have come across earlier in Eq. (4-1-5). We recall that cj- E 
is just the rate per unit volume per unit time at which mechanical work is 
being done on the charges by means of the electric field. Hence we have 


| cj: E dV = rate of increase of mechanical energy of charges 
vol 


We also recall that (E? + B?)/8x is the energy density of the electric and 
magnetic fields. The second term in Eq. (5-2-3) is thus the rate at which 
field energy within the volume is increasing. Finally, the first term in Eq. 
(5-2-3) has the appearance of an outgoing flux and must be equal to the 
rate at which energy is leaving the volume per unit time. So everything just 
adds up right if we interpret S as being a vector which points along the 
direction in which energy is flowing and whose magnitude is equal to the 
flux of energy per unit time through a unit area normal to itself. 

As we shall shortly see, Eq. (5-2-3) will be only one of the four energy- 
momentum conservation equations. We also anticipate that energy con- 
servation and momentum conservation can be simply represented by just 
one single four-dimensional equation among proper four-dimensional 
quantities. Thus we shall succeed once again in unifying electromagnetic 
field equations by means of relativity. In the meantime though, we demon- 
strate a very simple application of the above by returning to our old standby, 
the long current-carrying wire (see Fig. 5-4). No fields are changing, and so 


0 i age 3 


| = 


Fig. 5-4 We apply the general energy conservation equation to a segment of current-carrying 
wire. 
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We draw an imaginary surface of radius r and length / about the wire, as 
shown. The wire segment enclosed is assumed to have a resistance R. 
To produce a current J, we need an electric field parallel to the wire of 
magnitude /R//. The magnetic field at r circulates about the wire and has a 
magnitude equal to 2//r. Hence 


is| = c 2IR soy, 
. ~ 4x Ir - 
and 
2I7R 
| S-fidd = — -— 7 2nrl = —P Re 
surface 


We see then that the amount of energy entering the volume per unit time 
is just equal to the ohmic dissipation within the wire. 


5-3 MOMENTUM CONSERVATION IN ELECTROMAGNETISM 


It would be quite straightforward to derive the momentum conservation 
relations by simple manipulations of Maxwell’s equations. We choose, 
however, to use another approach, illustrating again the enormous power 
inherent in treating the electromagnetic field relativistically. We shall 
rewrite the energy conservation equation we have just obtained in terms of 
our four-dimensional representation. We will then change all subscript 
4s to Is, 2s, and 3s in turn, yielding the three momentum conservation 
equations. Simple inspection of these equations will show that they make 
sense and do indeed lead to the proper results when applied. 

We begin by rewriting the energy conservation equation (5-2-1) in 
slightly different form. 


-iS 10 (ASA 
: + a 


4 
ar ) ik = ~ 2 Jats (5-3-1) 


The expression on the right is the fourth component of a four-vector. 


—iS E* + B? 
aaa aieal are four components of a sec- 


Hence we anticipate that ( 
ond-rank tensor 7,,,. We have 


2S ~iS, = aya? 
= Ty, = oe 
™ 


(5-3-2) 
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Let us see if we can write these components in terms of the field tensors 
F and G: 


iS, —i(E x B), 
1 
= An (BrE, = B,E,) (5-3-3) 


Now there are two ways in which we can write 7,,. Referring back to Eqs. 
(3-5-6) and (3-6-9), we find (since F,; = Fy, = 0) 


1 
Ti, = Gq (fi2Faa + F,3F3,) 


4 
Aik (5-3-4) 
Alternatively we could write 


| 
Ty, = Gq (0 12G28 + G,3G34) 


4 
y oT Gus (5-3-5) 


Naturally, we expect that the prettiest combination of the above equations, 
namely, the one which is most symmetrical with respect to E and B, will 
match for all the components of Eq. (5-3-2): 


1 4 
fee 8 z (Fin Fus et G1,G,4) (5-3-6) 
Th ey 


We can check out the other components against Eq. (5-3-2): 


4 


» Gorka ats G,G,s) 


1 
Pe 
24 = BF rad 


= 
= Oa 
8x ( Zz x 2EFB:) 
= 
= x By, 
1 


4 
T34 = on y (Fae Gs aa) 
u=1 


—i 
= a, & x B), 
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1 


4 
144 = ‘8x , 2 ye “i Gay, G4) 


a 2 2 
; ees sta PEP > BY? + BY BR?) 


_ EP + B 


Be 8x 


So it seems we have found the tensor we are looking for: 


Sao - 3 (F, pu ie Go, Ge) (5-3-7) 
Because both F and G are antisymmetric, T is symmetric: 
doe = Top (5-3-8) 
The energy conservation equation can now be written as 
T, Eee 
mee (5-3-9) 
p=1 
If we replace 4 by 1,2, and 3 we should obtain the momentum conservation 
equations 
4 OT, 4 4 : : 
5 a ee fomoe—ele2, 4 (5-3-10) 
p-1 p p=1i 


We now rewrite this result in terms of E, B, and our charges and currents 
and see its physical significance. Let o = 1, for example: 


OD eeT.,, MOE; POSS ce. 
ay apis ee 
(G+ Gee “ (GB) — j:B,) + pE, 


= (jj x B), + pE, (5-3-11) 
6T,, | OT, | OTs, = é Sx 
{( Ox us oy i oz ws y ot oe 


a | [G x oe + pE,|dV (5-3-12) 


Using Gauss’ theorem, we can change the volume integral on the left into 
a surface integral. This yields 


o/s 
| (Tin, + Thin, + Ty,n,) dA = | =(3) dv 
Ss y Ot c 


¢ | [G x B), + pEJaV (5-313) 
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The integrand of the left side of Eq. (5-3-13) looks like the first compo- 
nent of the product of a three-dimensional tensor T and the normal vector 
fi. In fact, this portion of the four-dimensional tensor T is nothing more 
than the sum of the electric and magnetic stress tensors we have studied 
earlier (see Secs. 2-16 and 4-12). 


1 iE? B? 
Th 1 a ag (2 + B? as oe 5) 


4n 2 2 

1 iE B? 
Tas = ae (EP + BY a ae 

1 97 ee ea? ee 
i ae taal 


(5-3-14) 
1 
Tz = Tr = G- (EE, + B,B,) 


1 
Es Ts, ie An (Eee, tt B,B,) 


oa 
w 
| 


l 
Th = T32 = q, hE: a B,B.) 


We generalize Eq. (5-3-13) then by writing down the complete momen- 
tum conservation equation 


{ taaa -{ 5 (S)ar+ | (j x B + pE) dV (5-3-15) 
s y Ot \c Vv 


The surface integral on the left can be thought of as the total momentum 
flowing into our volume through the surface per unit time. (Alternatively 
one might think of it as being the electromagnetic “force” exerted on our 
volume by the outside world.) The first integral on the right should be just 
equal to the rate of change of the field momentum within the volume. 
This would indicate that S/c? is the momentum density of the electro- 
magnetic field. Finally the second term on the right is nothing but the 
force on the charges and currents within the volume and is thus equal to 
the rate of change of mechanical momentum within the volume. We have 
thus derived the equation for momentum conservation in the presence of 
electromagnetic fields. 


5-4 ELECTROMAGNETIC MASS 


As a charged particle moves along through space with velocity v, the electro- 
magnetic field it carries along has a momentum which depends upon its 
velocity. Inasmuch as we have learned that S/c? is the momentum density 
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of the field, we are now in a position to find the total field momentum for 
our moving particle. We shall see that it is proportional to both v and 
1/./1 — v*/c?, in precisely the same manner as the mechanical momentum. 
In the case of mechanical momentum, the constant of proportionality is 
called the mechanical rest mass. Similarly, we will call the constant of pro- 
portionality in evaluating the field momentum the electromagnetic rest mass. 
We shall then see that this is not exactly the same as the electromagnetic 
~-s1fass obtained by dividing the total field energy in the particle’s rest system 
by c?. This would be troublesome except for the obvious fact that we have 
left out another force entirely, the force which holds the charge together. In 
any case, there is no way of distinguishing mechanical mass from electro- 
magnetic mass by applying forces to the system. 

To simplify our considerations, we will make use of a specific example— 
a spherically symmetrical charge distribution of radius R moving with 
velocity v along the positive x axis (see Fig. 5-5). The direction of S is as 
shown in Fig. 5-5. Clearly, only the x component of S/c? (the momentum 
density) is not averaged out as we carry out an integration over all space. 
So our job consists of evaluating S,/c? everywhere and then integrating 
over all space to find p, (electromagnetic). 

The simplest way to proceed is to go over to the rest system of the 
particle (2’), calculate T’ in that system, transform T’ back to the labora- 
tory system with velocity —v, examine 7,, = —iS,/c, and then integrate 
over space. We first write T’. 


tM 
Ee Ee ELE, 0 
: E’2 
ELE — E,E, 0 
—— (5-4-1) 
4n EF? 
ELE! EEO ED — = 0 
E? 
0 0 0 a 


where E’ depends only on r’. 
To transform to the laboratory, we multiply T’ on the left with L and 
on the right with L (the transpose of L) where 


y 0 0 —ify 

0 1 0° O 
L= 

001 0 

ipy 0 0 ¥ 
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B out E 


Fig. 5-5 We are interested in evaluating the electromagnetic momentum associated with a 
moving charge. 


Carrying out the multiplication yields 


*Qn,2 
ipy , , 
ay (5-4-2) 
and hence 
2 TRE 
Sa mit E” sin? 6’ :’ where cos 6’ = — (5-4-3) 
4n E 


To evaluate the total field momentum we integrate over all space. 


1 Bye en 
— i “av -4-4 
p,(em) ae {, (E’)* sin* 0’ d (5-4-4) 
space 


But this is an integral much easier to evaluate in the L’ system. We remember 
that because of the Lorentz contraction, ydV = dV’. Hence 


z = an | (E’)?r’? dr’ | sin? 6’ dé’ 
0 


0) 


By | * (B)2r? dr’ (5-4-5) 
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But the total electrostatic energy in the particle rest system is given by 
] le) 
OE = =| (Ber ar’ (5-4-6) 
0 


Hence we can write 


p,(em) = = ae (5-4-7) 
c 

We would have expected, in the absence of any other forces, to have 
obtained the relationship p,(em) = (fy/c) Up. 

Unfortunately, though, we have left out some factors which must play 
an important role in this problem. There has to be some additional force 
around to hold the charge together! Having omitted this force, which would 
presumably lead to a change in both the internal energy of the system and 
its momentum, we cannot expect to get a completely consistent theory. 
In any case, we have come up with the kind of velocity dependence we were 
looking for, and we are ‘“‘almost’’ right in the relationship between rest 
energy and momentum. 

Needless to say, there is no way in which one can experimentally 
separate the portions of the particle’s self-energy which contribute to 
its mass. When we act on a charged particle with an external force of any 
sort, we act on the entire mass regardless of its origin. 


5-5 THE DISPLACEMENT CURRENT 


It is interesting to reexamine Ampere’s law, now that we are permitting the 
charges and currents to be time dependent. We recall that Ampere’s law 
was derived from one of Maxwell’s equations, the one which related the 
curl of B to the current density. Reinserting the time-dependent terms which 
we dropped when we were studying magnetostatics, we have 


V x B= 4zj + — — (5-5-1) 
c Ot 


We observe that the effect of CE/ét is equivalent to that of an additional 
current density jp given by 
leer 

12% 4nc Ot 


This term is given the name of displacement current density. 
Ampere’s law can now be rewritten in the following form: 


(5-5-2) 


¢ B- dl = 4n(J + Ip) (5-5-3) 
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where 


Ip 


total displacement current through surface bounded by C 


| ip: dA (5-5-4) 
surface 


Needless to say, the total current given by j + jp must be divergence- 
less, or else the sum of J + Ip would depend upon which surface we chose 
to cover our curve C. Let us check this out. 

| goggle 


a i ai il il = 


= 0 (conservation of charge) (5-5-5) 


To observe the application of this new formulation of Ampere’s law, 
we make use of the example of a parallel-plate capacitor which is charging 
at a constant rate. We allow the plates to have area A, as shown in Fig. 5-6. 

We wish to evaluate the field along the indicated curve C. Now we have 
two choices for the surface over which we wish to do the current integration. 
We can choose a surface S, through the wire, in which case we will get 


} B- dl = 4nI (5-5-6) 
c 


Fig. 5-6 We apply the modi- 
fied Ampere’s law to find the 
field around a wire charging a 
capacitor. 
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Alternatively, we can choose a surface S, through the capacitor. In this 
case we will get 


1 { dE 
B- dl = — | — : 5. 
? ere fi dA (5-5-7) 
But E- ft = 42Q/A and hence 
Se cE. 4 dQ Ax 
ee tae mr ies a 
This leads us again to 
1 4 
f B- dl = — — lA = nl (5-5-9) 


Thus, at least in this simple-minded case, the application of the modified 
Ampere’s law leads to the anticipated result. 


5-6 THE FOUR-VECTOR POTENTIAL AND HOW IT IS MODIFIED NOW 
THAT CURRENTS AND CHARGES ARE CHANGING WITH TIME 


When we were deriving Maxwell’s equations in Chap. 3, we found that the 
basic differential equations for A and g had the form [see Eqs. (3-4-9) to 
(3-4-12) ] 


Vv? ee A(r,t) = —4nj(r,1) 
eo Or a) TEE, 
(5-6-1) 
Cc Or 
We also wrote down the solutions in the event that there was no time 
dependence [see Eqs. (3-4-13) and (3-4-14)]: 


A(r) = | Ae) av’ 
all 


Fr 


(v spall Joe. a 


space 


: (5-6-2) 
o(t) = | =a 


r-r 


space 


We will introduce time dependence by a rather intuitive guess and then 
go on to demonstrate that our guess does indeed satisfy Eq. (5-6-1). 

Let us assume that a given current (or charge) at r’ contributes to 
the potential at r in the same manner as when there was no time dependence 
but that the information travels from r’ to r with a velocity c. Thus the 
bit of current at r’ at time ¢ — |r — r’|/c gives rise to a bit of potential at 
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position r and time ¢. Adding all contributions from all r’ together, we have 
now 


oad 
space 
|r —_ r’ (5-6-3) 
eee 
r,t) = - dv’ 
g(r,t) {, a 


If these “solutions,” which appear quite plausible on the surface, really 
do satisfy the appropriate differential equations, we will have achieved 
our goal. After this we can find all electric and magnetic fields as a function 
of space and time by means of the relations given in Eqs. (3-5-4) and (3-5-5). 


1 odAC,t 
B(e,) = -Volr,) — — = 


Bir, = V x AC, 


Our first chore then is to demonstrate that (r,t) and A(r,t) as we 
have written them are solutions of our differential equations. We tackle 
only g(r,?) explicitly. That A(r,/) satisfies our equation follows by complete 
analogy. Let us take u = t — |r — r’|/c. Now we will have need of the 
various derivatives of p: 


es 5 Oe 
ax du Ox 
Op _ Op bu — op 
a On On yielding Vp = Oa Vu (5-6-4) 
Op Op ou 
Oz Ou Oz 
op 0p 
Vip = Vee li 5 
p=V ay Vu + a V*u 
0’p Of, 
= aye VY Vu + age u (5-6-5) 
Also 
on 6? 
aoe 2 (5-6-6) 
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Now 
[a co) 
c clr —r 
1 | V-(r-r’) 
V2 ee ee ( 
Al = ae 71 
—— 1 3 
c lr — r'| lr — r’| 
—2 
clr — r'| 
ee 
2... — , 
FP lr — r’ 
+ 2| vote 9 oy For dV’ 


] 
ar IG era p(r’,u) dv’ 


But we have already learned how to treat the third integral. 


1 
Vv? =e = 0 except whenr = r’ 
_ 


We also know that 


1 
| v- =F] dV’ = —4n __ (see page 32) 
all 


Thus we can write 
1 1 
Vv? -_——.. p(r’,u) dV = p(r,d) | V? ——- dV’ 
jr — r'| lr —r'| 
= —4np(r,t) 


As far as the other integrals are concerned, we have 


yi & (r’,u) 
Par ame dp 


s)he 


lr — rr’ 
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(5-6-7) 


(5-6-8) 


(5-6-9) 


(5-6-10) 


(5-6-11) 
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and 

op 1 
= nso see 5-6-12 
Zi du |r —r'|? ( ) 


Hence the sum of the other integrals is zero, and we are left with 


2Vp(r’'u): V - 


lr—r 


v2 7 = —4np(r,t) 
ms rr g(r,t) = —4np(r, 
Our proof is complete. 

In our next chapter we will develop an intuitively appealing technique 
for evaluating this integral which is particularly useful in dealing with the 
field of a moving, small charge. We will then observe how the acceleration 
of a charge gives rise to a remarkable new phenomena, radiation. 


PROBLEMS 


5-1. Show that A and @ as given by Eq. (5-6-3) satisfy the so-called Lorentz con- 
dition, namely, 
V‘A+ hi we = 0 

iG wor 

5-2. A circular loop of wire having radius 5, resistance R, and self-inductance L 
is set with its plane perpendicular to a time-varying magnetic field B(t) = By 
COs wt. 
(a) Develop and solve a differential equation describing the current through 

the loop as a function of time. 

(b) How much energy is dissipated in the loop per unit time? 


5-3. A resonant circuit is constructed by putting a capacitor and an inductor in 
series. The physical dimensions of both the capacitor and the inductor are 
completely known, and no dielectrics or magnetic materials are present in the 


system. The circuit is now observed to resonate at a frequency w. Show how 
the velocity of light c can be determined entirely in terms of the physical dimen- 
sions of the system and the resonant frequency w. 


5-4. A transmission line is made up of two long parallel perfect conductors of 
arbitrary cross section. Current flows down one conductor and returns on the 
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5-5. 


5-6. 


other. The conductors are surrounded by vacuum. Show that the inductance 
per unit length L and the capacitance per unit length C are related by the equation 
1s (Ca—ale 


{Hint: Make use of the Lorentz transformation to move with some 
velocity v parallel to the conductors. Remember also that L is measured in 
emu and C is measured in esu. If L is measured in esu then LC... = 1/c?.] 


Find the self-inductance per unit length of two coaxial thin-walled tubes, the 
inner one having radius a and the outer one having radius b. 


Two parallel wires, each having radius a, are separated by a distance b. A 
current J goes down one and returns on the other. It is spread uniformly over 
the cross section of each wire. Find the inductance per unit length of the pair 
of wires. 


An electron travels in a circular orbit of radius R about a fixed proton. A 
magnetic field By is now applied in the same direction as the electron’s angular 
momentum. Find the change in the orbital magnetic moment of the electron 
as a result of the application of this magnetic field. 


A charge q is moving in a circular orbit of radius R about the center of a cylin- 
drically symmetrical magnet, as shown. Assume that the orbit of the charge 
lies in the median plane between the poles of the magnet and hence the only 
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component of magnetic field that it sees is in the z direction. The field is now 
allowed to increase with time. Show that the particle will accelerate without 
any change in radius if the increase in the average field for r < R is twice the 
increase in the field at r = R. That is, 


dB(R) _ , aB(R) 
dt «dt 


where 


= 2h 
B(R) = z| B(r)r dr 
R® Jo 
5-9. A long, uniformly charged nonconducting cylinder of radius a carries a charge 
per unit length of A. It is wound with N turns per centimeter of wire carrying a 
current J. This current gives rise to a magnetic field which we will consider as 
uniform throughout the cylinder. 
(a) Find the Poynting vector as a function of distance from the axis of the 
cylinder. In which direction does it point? 
(6) The momentum density of the electromagnetic field is given by S/c?. Find 
the angular momentum per unit length of the electromagnetic field about 
the axis of the cylinder. 


(c) The current J is now turned off. This gives rise to an induced electric field 
(by Faraday’s law) which exerts a torque on the cylinder. Find this torque 
per unit length in terms of dJ/dt. 

(d) Integrate the result of part (c) to obtain the total change in the angular 
momentum per unit length of the cylinder. Compare this result with the 
answer to part (5). 


5-10. Consider a charge e and a magnetic monopole g situated a distance S apart. 
(a) Calculate the total angular momentum of the electromagnetic field about 
an axis though g and e and show that it is totally independent of S. 
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(b) Set e equal to the magnitude of the electron charge and set the angular 
momentum of the field equal to the intrinsic angular momentum of the 
electron (h/4z). How large is g? 


6 
Let There Be Light! 


We now approach the high point in our study of electromagnetic theory. 
Although we have discovered many remarkable effects, none compares in 
importance with those in the area we are about to explore. We shall find 
that if we accelerate a charge it will emit radiation. Specifically, it will 
cause electric and magnetic fields to appear which decrease inversely as 
the distance from the charge to the first power. This is totally different in 
quality from what we have been used to, namely, fields which decrease 
inversely as the second power of the distance from their source. Furthermore, 
the new fields will be transverse fields. That is to say, their directions will be 
transverse to a line joining them to the accelerating charges. In addition 
these fields will appear to be moving with velocity c away from the accelerat- 
ing charge. If we were not so familiar with radio waves and the electro- 
magnetic nature of light, this would be an unbelievable discovery. We 
212 
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would head right out, as Hertz did, and try to produce these long-range 
electromagnetic disturbances, discovering radio communication in very 
short order. But, although we are somewhat jaded from long exposure to 
these phenomena, it will still be exciting to really understand their origins. 
This we now proceed to do. 


“6-4 A NEW WAY OF CALCULATING RETARDED POTENTIALS 
IN AN INTUITIVELY APPEALING MANNER 


As we remember from our last chapter, one way of solving an electro- 
dynamics problem with time-varying currents is to calculate the retarded 
four-vector potential. For example, we would find g(r,/) by evaluating the 
integral 


(ei) 
pir,t— 
ote. = | mselaedy” 


a fr—r 


To evaluate this integral we must add together the contributions from charges 
at various distances from our position r, always dividing by the distance 
|r — r’| and always evaluating the charge at an earlier time = t — 
jr — r’|/c. 

To evaluate this integral can be quite a labor. We will develop here 
an intuitively appealing method of performing the integration in physical 
terms and then apply it to find the potentials due to a moving point charge. 

Let us say that we are interested in ¢ at position r and time ¢. Suppose 
at the instant ¢ we started our clock going backward in time, with all the 
moving charges retracing their paths exactly. At exactly the same instant f, 
let us send a spherical “‘information-gathering” pulse out with velocity c, 
also moving backward in time (see Fig. 6-1). As the pulse reaches a given 
bit of charge at position r’, it observes the charge as it was at time ¢ — 
|r — r’|/c. But this is just what we want for our integral, so we count the 
charge, divide by |r — r’|, and go on. 


Fig. 6-1 A spherical shell expands with 
velocity c as our clock runs backward in 
time. As we encounter some charge, within 
the thickness d, we add it up and divide by 
|r — r’|, the radius of the shell. 


214 LET THERE BE LIGHT! 


To see how we proceed more clearly, it is perhaps better to go through 
our motions in a series of discrete steps and add rather than integrate. We 
let our pulse be the spherical shell shown in Fig. 6-1, with a shell thickness 
d. Every time our negative-running clock changes by d/c sec, we advance 
the shell by a distance d (thus the average shell velocity is c). 

Now as we come upon some charge at a distance |r — r’| from our 
starting point, it appears as it was a time |r — r’|/c ago. While our shell is 
sitting there, we add together all the charge within its thickness d and divide 
by |r — r’| (the radius of the shell). We put this number [it is just (p dV)/ 
|r — r’|] into our adding machine and then sit back and relax until the shell 
moves on to its next position. Of course, as the shell moves ahead one 
position, the charges shift too, to the values they had at time ¢ — |r — r’|/c 
— d/c. We repeat our calculation, add the result into our adding machine, 
and again wait for the shell to proceed. 

Now isn’t this sum just exactly what we want? We are just adding 
together all the p dV at the appropriate earlier time and dividing by the 
distance |r — r’|. We use discrete steps, but that is no problem inasmuch 
as the thickness d can be made arbitrarily small, leading ultimately to an 
integral rather than a sum. It would seem then that we have found a new 
technique for evaluating both g(r,f) and A(r,‘)—a technique which, as 
we shall see, is especially useful in dealing with moving discrete charges. 


6-2 THE POTENTIALS OF A SMALL MOVING CHARGE 
(LIENARD-WIECHERT POTENTIALS) 


We wish to find the potentials at position r and time ¢ due to a small moving 
charge at position r’. By small we mean that we can neglect the variation of 
1/|r — r’| as the imaginary spherical shell we have just described passes 
over our charge. Now we will carry forth our arguments, assuming the 
charge volume to be rectangular in shape and uniform in charge density. 
(Of course, any other charge distribution can be subdivided into such 
elements and their separate contributions can be added together.) If the 
original velocity of the charge at time ¢’ was v‘(t’), we now give it a velocity 
—v‘(t’) for the purpose of doing our sum. As can be seen from Fig. 6-2, if 
—v'(t’) is in the direction away from r, then our expanding sphere will spend 


Fig. 6-2 The expanding sphere 
travels a distance L while still 
overlapping the charge. 
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more time overlapping the charge, and hence the contribution to the poten- 
tials will be larger than if the charge were stationary. Conversely, if the 
velocity —v‘(t’) will now be toward r, the sphere will spend /ess time over- 
lapping the charge, and the contribution to the potential will be smaller 
than if the charge were stationary. Referring again to Fig. 6-2, we see that 
our spherical shell travels for a total distance L while still overlapping the 
_ charge. Hence the potential we obtain is just 
ae 
q L 
lr—r(t’)| 1 
where / is the length the charge appears to have. But 
L-! will 
v-(r—ry/fr—-r| cc 
Hence 
GENS « — rt 7 
rae [r a0) wee 


clr — r’ 


p(r,t) = (6-2-1) 


(6-2-2) 


This leads us to the result 


q 


er [ 7 ie 


c 


g(r,t) = (6-2-3) 


where é’ is a unit vector from r’‘(¢’) to r. 
r—r(t’) 


aan jr — r'(¢’) 


(6-2-4) 


Similarly we have for A(r, 4), 


qv (t’) 


A(r,t) = TON (6-2-5) 
clr — r'(t’) E = mee) 


By means of these so-called Lienard-Wiechert. potentials, we may now 
calculate the electric and magnetic fields due to our small moving charges. 
Although our usual concern will be with situations where v < c, we will 
for the moment carry along all orders in v/c so that in the future we can deal 
comfortably with charges that are moving at relativistic velocities. Our 
job then becomes one of differentiating these potentials. That is, 


B(r,’) = V x a” (6-2-6) 


aan v-& 
ele — r9|(1 - : ) 
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rr 
ne re|(t 2 ) 
(e 


E(r,t) = —V 


6-3 DIFFERENTIATING THE LIENARD-WIECHERT 
POTENTIALS; THE RADIATION FIELD 


We now undertake the rather laborious task of performing the differentia- 
tion called for in Eqs. (6-2-6) and (6-2-7). The difficulty lies in the complex 
implicit dependence of all the primed terms on r and ¢. We will prepare 
ourselves with a few preliminary derivatives. 


td -3- 
t 5 (6-3-1) 
Now 
a af q or 
aw c \ or ot 
and 
dlr — r’| oe 
ar’ was 
Hence 
Ot 1 a5 
— Vv -é (6-3-2) 
ie 
c 
Similarly 
1/0 
Vi’ a a , J 
me “(4 |r r|) vs 
and hence 
aug 
vie aa0 
yee (6-3-3) 
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We will also find the following useful: 
ag’ ra) ( r—r ) _ W-é)r - Pr) v 


Or oat lr —r'| lr —r'|? 7 22 
or 
Oe’ x (& x VV) 
See od 


We can now evaluate 


7) : v8 — ve 
alle ras | = =) 


= r-e[t-e -<fe x (& x v’)] 


or 
, 7 ne — r’la’. $’ 12 
valet ( -"*)| al GY iy , 
ot c c c 
which yields 
é : V-& lr—-r|., , Bey rete 
valet (\- *}]- F (a &’) v (« ~) 
(6-3-5) 
where 
dy’ 
‘= 6-3-6 
= (6-3-6) 
We are now ready to take the gradient of 9: 
op ,, 
—-Vo ae AVP) coax a Or Vt 
qé" — v'/c) 
SC a mT AT! 
|r — rP(1 - ) 
c 
do q ; é | ; v-& 
= = oe Bs 1 
, , 7\2 or |r r’| 
at = (1 v es ) 
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Combining our terms, we have 
&y1 — = 1-—-—-&@ 
al ( e € c 


=Vo = ae 
ee 
c 


Next we calculate 


_ 1A _ 1 aA or 
Con C TOLre CL 
waa a av 
c at at ames “) 
t 
—qa’ qv 


|- lee (a’-&) —v'- G = “| 
(e (é 


or, rewriting slightly, 
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Combining —Vg and — i =. we have 
—q[a’ — (a’-8’)e’] + 98’ x (« x “) 
K(r,f) = ————— —eee 
S c7|r — r'| jvm Tal 
SS eee Cc 
¢ 2 
q\ & — — (: i= “) 
+ ve: (6-3-9) 
lr —r’|?(1 - ) 
cS 


Next, we calculate the magnetic field B = V x A: 


VxA= (V x Aen ad (3) x Vr 


Now 


Pa Vv 
af * al aes 
(V x A):const a) a 


(6-3-10) 


ed Bays le en 
aA —qga'+qé «(2 x “| w|* (« )] 
_ a Tema Te y] me og: 2 
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and 


Vv 
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We finally have then 


¥ Al v’? i Af if ve 
qe i ae teed) ORs 
B= + & x vy -é’\3 
Ajr-ri(1- ) 

C 


(6-3-11) 


Examining our expressions for E and B, we note that they each divide 
naturally into two parts—that which is independent of acceleration and 
drops off as 1/|r — r’|? and that which is proportional to the acceleration 
of the charge and drops only as 1/|r — r’|. The terms independent of a 
can, as expected, be obtained directly by carrying out a Lorentz transforma- 
tion upon the electric field of a static charge g. The terms proportional to a 
are the ones which interest us now; they are the so-called radiation fields. 
We thus will ignore all terms that are proportional to 1/|r — r’|? in our 
present discussion. 

We first note that both E and B are perpendicular to é’ and to each 
other. This is characteristic of the so-called transverse nature of electro- 
magnetic radiation. The magnitude of E is the same as the magnitude of B— 
a highly desirable result since otherwise we could transform either E or B 
away by an appropriate Lorentz transformation. 

We also note the remarkable fact that this electromagnetic field seems 
to be propagating with time at a velocity c. That is, a given acceleration 
at time ¢’ leads to fields at position r and at time ¢ such that |r — r’|/c = 
t — t’. As ¢ increases, the relevant field appears further and further away 
from where the acceleration was that gave rise to it. 


/ 


6-4 ENERGY RADIATION: NONRELATIVISTIC TREATMENT 


To see what happens insofar as the flow of energy is concerned, we can 
evaluate the Poynting vector S. 


~ 


ae une , Ce ae 
S=7 Ex B=7_Exé€ x E)= 7 Be (6-4-1) 

We will make the nonrelativistic approximation that the velocity 
of our charge is much less than c. This assumption serves to simplify our 
work considerably and is a perfectly valid assumption in all the applications 
we will be considering. 

To evaluate S, we must first have another look at E. We have, for 
ie a) 


(6-4-2) 
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where 
oe eel , \ ff 
a,=a’ — (a’-&)é (6-4-3) 


As we can see, aj is the projected acceleration perpendicular to the line 
of sight &’. [Intuitively one should think of this as follows. The electric 
. field is directly proportional (with proportionality constant —gq/c*) to the 
acceleration that we would “‘see’’ as we watch the particle. The acceleration 
we see must obviously appear as it was at an earlier time ¢’. Furthermore, 
its apparent magnitude goes down as 1/|r — r’| as we go further and further 
away from the location of the acceleration. Finally, what we would see 
is just the projected acceleration, that part which is at right angles to our 
line of sight. This ‘intuitive’ approach to understanding the radiation 
field due to an accelerating charge will be particularly useful when we begin 
to examine coherent interference effects due to the organized motion of 
large assemblies of such charges. | 
We wish to calculate the total energy radiated per unit time by a charge 
with acceleration a’ at time ¢’. Its effect is felt at a distance R away at a 
time R/c later. We draw a sphere of that radius about our charge, as shown 
in Fig. 6-3. The magnitude of the electric field at (R,6) is just 


qa’ sin 8 
E(R,9)| = —3z—- 6-4-4 
E(R,0)| = >, (6-4-4) 
and hence 
Pati?) es? 
q’a’* sin’ 6 
.( 6-4-5 
is| = 2 (6-4-5) 
2 a ‘ 
\ 
\ 
\ 
\ 
\ 
\ 
iv 
0 22a \ ' 
Pe Se Fig. 6-3 We evaluate the en- 
ergy lost by an accelerating 


of the Poynting vector around a 


\ 

\ 

\ particle by integrating the flux 
{ 
| sphere of radius R. 


222 LET THERE BE LIGHT! 


The total amount of energy which was radiated per unit at time ¢’ is then 


GaGa 
—_— = 2x sin? @) dé 
di ~“*4ne* Np ( ) 

5) Te a? 

3. a 

This expression will be encountered quite often in studying the electro- 
magnetic manifestations of particle acceleration and is known as the Larmor 
formula. 


(6-4-6) 


6-5 POLARIZATION 


One of the immediate consequences of the transverse nature of electro- 
magnetic radiation is the fact that only two polarization states are available 
for each radiation direction. That is, we need to choose only two orthogonal 
unit vectors, at right angles to &’, and we shall then be able to express our 
electric radiation field entirely in terms of components along these two 
vectors. 

To illustrate then the origin of the various types of optical polarization 
we have learned about in elementary physics, it is useful to examine in 
detail the radiation pattern due to a charge moving in a circle, withy « c. 
In Fig. 6-4 we sketch a coordinate system relative to the center of the circular 
path taken by the charge. 

The charge moves counterclockwise as viewed from the +z direction. 
As we observe the charge from the various directions, we see the acceleration 


Fig. 6-4 A charged particle moving in a circle gives rise to circularly 
polarized radiation along the axis of its motion and linearly polarized 
radiation at right angles to this axis. In other directions the “‘light’’ is 
elliptically polarized. 
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vector carrying out its periodic mction with the rotational frequency of the 
particle. From along the +z axis, the acceleration vector always appears to 
have the same length and rotates about in a circle. Hence we have circularly 
polarized light going out along the positive and negative z directions. 

At any given point along the +z axis, the electric field vector appears 

q to rotate in a counterclockwise manner as we look toward the charge. We 
‘will call this right-hand circularly polarized radiation, since if the right thumb 
points in the direction of propagation (along the +z axis), the fingers 
follow the direction in which E changes with time. We note, of course, 
that if we explore the pattern of E along the + « axis at a fixed time, it looks 
like a left-handed screw. 

Along the negative z axis the electric field has the spatial distribution 
corresponding to a right-handed screw and rotates at any given point in a 
clockwise manner as we look toward the charge. Naturally, this is called 
left-hand circularly polarized radiation. 

In the xy plane the radiation is plane polarized with the electric 
vector always lying in this plane. 


6-6 THE SCATTERING OF RADIATION BY A FREE ELECTRON 


If our radiation neld is now incident upon a charge, it will obviously ‘‘shake”’ 
it. That is to say, the electric field will cause the charge to accelerate and this 
in turn will cause the charge itself to radiate. In this way we can say that the 
charge has effectively ‘“‘scattered” some of the incident radiation. It is 
remarkably simple for us to calculate the fraction of the incident energy 
per square centimeter at the charge that is reradiated in this manner. This 
fraction is called the Thomson cross section when evaluated for an electron 
in the limit that its velocity is considered small compared with c. (The 
term cross section is a natural one to use when asking for the fraction of 
the energy flux per square centimeter that is affected by a particle. We 
imagine that we replace the particle with a disk of a given cross-sectional 
area. If the radiation passes through it, it is affected; otherwise it is not. 
The symbol for cross section is invariably a.) 


energy radiated by electron per unit time 


OThomson = magnitude of Poynting vector of incident radiation 
(6-6-1) 
Now, the acceleration of the electron is just given by 
ed (6-6-2) 


m, mM, 
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and hence the energy radiated by the electron per unit time is just 


du _ 2e7a? oe e* E? 
dt 302 3 mc 


(6-6-3) 


The magnitude of the Poynting vector for the incident radiation is given 
by : 


|S| = ra E? (6-6-4) 
Thus we have 
8x et 8x 
OThomson = =e a, 24 = Can ro” (6-6-5) 


where we define r, = classical radius of electron = e?/m,c?. Incidentally, 
we note that the classical radius of the electron is of the same order as the 
radius the electron would have if its mass were largely electromagnetic in 
origin (see page 200). 


6-7 MATHEMATICAL SUPPLEMENT: COMPLETENESS 
AND ORTHOGONALITY 


As we become more and more sophisticated in our study of physics, we will 
find increasingly more numerous the occasions when we make use of com- 
plete sets of orthogonal functions. (In fact, when we come to the study of 
quantum mechanics, we will discover that the sets of solutions to given 
boundary-value problems are all this type.) We have already come across 
one such set of functions, the Legendre polynomials P,(cos 6) that we 
studied in electrostatics. Other common examples are the harmonic func- 
tions sin kx and cos kx and the Bessel functions. Let us then explain in 
general terms what we mean by completeness and orthogonality. 

A set of functions g,(x) will be considered complete over a given 
range of x if any reasonable’ function f(x) defined over this range can be 
expanded as a linear sum: 


fs) = ¥ Aa) (6-7-1) 


The functions will be considered mutually orthogonal over the range x,,;, < 


min 


‘For a precise definition of what we call reasonableness the reader is referred to any standard 
mathematical text covering functional analysis and eigenfunction expansions. All functions 
which we are likely to come across in physics are reasonable. 
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| gi(xg(x)dx=0 fori4s (6-7-2) 
. where g; is the complex conjugate of g;. 
Sees 


6-8 MATHEMATICAL SUPPLEMENT: FOURIER 
SERIES AND FOURIER INTEGRAL 


Perhaps the most useful complete set are the harmonic functions sin kx 
and cos kx. We begin by considering only the interval 0 < x < a over 
which we wish to approximate the function f(x). Since we do not care about 
J(x) outside this interval, we are free to take it periodic with a periodicity 
given by 


I(x) = f(x + a) 


Thus we need only make use of those values of k which make sin kx and 
cos kx periodic with this same period. Thus 


ka = 2nn where n is an integer 


We have then, for our expansion, 
foo) 4 oo 2 
f(x) = ¥ A, sin a xe y, B, cos Bil (6-8-1) 
n=0 a n=0 a 


That the individual functions are orthogonal can be seen from inspection: 


9 2nn . 2nm 
sin ——x sin ——x dx = 0 unless n = m 
0 a a 


I 
= 


2 2nn 2am 
| cos ——x cos ——x dx = 0 unless n 
a a 


si 2an 2am 
| sin ——x cos ——x dx = 0 for all n,m 
a 4 
That they form a complete set can only be asserted at this time and not 


proven. For a detailed proof the reader is again referred to the mathematical 
literature. Our problem then becomes simply one of finding the coefficients of 


sin path and cos ae We multiply both sides of our expansion for f(x) 
a a 


. 2am ; 
by sin ——-x and integrate from 0 to a. 
a 
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a lee) a p: : : 
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+ EB, | cos x dx 
n=0 0 
a 
gia 
2 
We have then 
Ee 2 
Ae al fd) sin x dx (6-8-2) 
a jo a 
Similarly 
2e {e 2 
pe 2 | f(x) cos ee x dx (6-8-3) 
aN; a 


As the range of x over which we wish to consider the function f(x) becomes 
infinite, our series naturally becomes an integral. We write 


= i ‘ A(k) sin kx dk + \, B(k) cos kx dk (6-8-4) 
0 0 


where A(k) dk is the contribution to our “sum” from the interval between 
k and k + dk. We remember though that 
eikx ae e7 ikx f eikx a e7 ikx 
cos kx = —3 and sin kx — eo 
i 


Substituting, we have 
Ps ; co 
Y 0 


Both A(k) and B(k) must be real if f(x) is to be real. Let C(k) = ./n/2 
[ B(k) — iA(k)]. Then we have 


1 Ke ; 1 2 : 
Sa C(k)e"* dk + —— *(k)e” *™* dk 
I(x) a (k)e + “all C (ke 


If we change variables in the second integral, from k to —k, and define 
C(—k) = C’(k), we can combine our terms to obtain 


co 


foe 1" Gee ae (6-8-5) 


To obtain C(k) if we are given f(x) we proceed in a manner analogous to 
that used to obtain A, and B, in the case of Fourier series. We multiply 
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both sides by e~ ** and integrate over x from — 0 to oo. 


y Ie * d= C(k) dk [" eueae™ dx (6-8-6) 


= oe 


We now wish to evaluate the integral 


foe) a 
| eh egy = lim | ema 
— oO ao —a 


1 ; ; 
= li ik-k’)a _ ,-i(k—k’)a 
ay ae 
So a* , 
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Let us have a look at the function [2 sin(k — k’)a]/(k — k’) as we let a get 
very large. In Figs. 6-5 and 6-6 we plot the function for a = 1 anda = 10. 
As is apparent, the function becomes more and more peaked at k = k’ 
as we let a increase. However, the total area in the peak remains constant 
and has a value of about 42a(2n/a) = 2n, as can be seen from examining 
the graphs. (The actual area is in fact exactly 27.) Now, as we take a larger 
and larger, we can begin to ignore the function outside the region k = k’, 
inasmuch as it will average to zero in any integral. We define the function 


2 sin(k — ka 
Wo — 7 


(6 is called the Dirac 6 function.) And we have then, for any integral including 
the peak, 


(6-8-7) 


2nd(k — k’) = 1 


ao 


k'+ & 
Ok — k) dk =1 


Ke 


We return then to evaluate 


| fde** dx = = | C(k)2nd(k — k’) dk 
- SOR 
= /2n Ck’) 


Fig. 6-5 A graph of the function [2 sin(k — k’)a]/(k — k’) for a = 1. 


y 


, 


oo <r 


228 LET THERE BE LIGHT! 
Hence we have 
1 2 ie 
C(k’) =_ Tin | f@e=* ax (6-8-8) 
TR = 05 


The functions C(k) and f(x) are called Fourier transforms of each other. 


Fig. 6-6 A graph of the function [2 sin(k — k’)a]/(k — k’) fora = 10. 
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6-9 THE INTERACTION OF RADIATION WITH A 
CHARGE IN A HARMONIC POTENTIAL 


We consider now in detail the “scattering” of radiation by a charge g ina 
harmonic potential. We will assume that the incident radiation is traveling 
in the +z direction and is polarized with its electric vector along the x 
direction. This latter assumption involves no loss of generality for, as we 
shall see, it will be possible to superimpose the effects due to a superposition 
of incident radiation. We begin with a very simple incoming plane wave 


E,, = E, cos(kz — ot) where w = kc 


At z = 0, we can write 
Eom 
= ee ee) (6-9-1) 


We assume the restoring force on the charge to be equal to mw”. In addition 
we assume a resistive force proportional to the velocity of the charge. 
Hence our differential equation for x, the displacement of the charge from 
its equilibrium position, becomes 
d*x dx Gina, 
_ ae ie eae eT a a 6-9-2) 
de) dt 2m ( ) ( 
We assume x to have the form x = 4e'® + Be~'®' and substitute back 
into our differential equation. 


ar wager = 


Let iene 2) GE | ico 
E aw + iyw + @o*) mat 


7 gE. —iw 
+ | =o — iy@ + Wo”) — ol. al) 


For this to be true at any time we must have 


qE qe 
= B= ——_>__ 6-9-3 
2m(@” — w* + iyo) 2m(w? — w* — iyo) a 
We have then 
we gE eiot gE eit 
2m(Wo2 — w* + iyo) 2m(@2 — w* — iy) 
qEo eilot—) ie e lot—¢) 
i m/(@o? — wo)? + yor 5) 
2 cos(wt — 9) (6-9-4) 
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where 


yo 
tang = —;——_; 6-9-5 
iia ae (62925) 
Now, we are not quite finished, since we can add to x any function x’ which 
is a solution to the differential equation 


— + w?x' = (6-9-6) 


The sum x + x’ will also be a solution to our original equation. The solution 
for x’ is well known. It is 


| y | y 
x =e /2 (4 sin /@? — as Bcos_/@)? - r 1) (6-9-7) 


where A and B are arbitrary. Inasmuch as this solution is exponentially 
damped with time constant 2/y, we can effectively ignore it in most cases. 

Now we wish to find the fraction of incident energy per unit area 
which is reradiated or scattered by the bound charge. Again we make 
use of the Larmor formula for the instantaneous rate at which energy is 
radiated. 


Pra = a 3 a (6-9-8) 
But 

ax —qE,w* 3 

a? mJ@e-o)+ par om 
Hence 


_ 2 - gE,’ o* cos? (wt — 9) 
ad 3 m3[(wo? aS w*)? ie yo" } 


aU 


(6-9-10) 


The average radiated power [remembering that the average value of 
cos?(wt — g) is 4] is just 
1 Gg Eg** 


mt FS Oly? — oF? + a] oan 


The average incident power per unit area is just 
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We have then, remembering that ry = q?/mc’, 


De 4 
Viney 8m, @ 


Oscattering = ii = a Yo (0,2 — o)? + Po 
Now, we might ask, what if the incoming radiation were not just a pure 
harmonic function of one frequency? We would like to make use of the 
power of Fourier analysis to achieve a solution to the problem. Let us 
assume again that our charge is at z = 0 and that the electric field at z = 0 
is in the x direction. Let E,(t) be the field at the charge. Then we can write 


(6-9-12) 


Ey = E(w)e' daw (6-9-13) 


i 


We wish to solve the differential equation 


ax dx . gE,(t) 
ae +y oo + Wx = aT (6-9-14) 
If we let 
eek | G(w)ei”! daw (6-9-15) 
a / Stee oo 
then we obtain 
- . é 
=. | {ilo + iyo + @7] - g SO} dw = 0 
ele — os m 
(6-9-16) 
If we wish this equation to be true for any value of t, we must have 
gé(w) 
= 6-9-17 
GO) = a? — o + ya) ae 


We are thus in possession of the Fourier transform of x, which, upon 
integration, yields x directly: 


Sites wall | ee (6-9-18) 
m./2n J —@ (Wo? — 7) + iyo 


To complete our solution we need only differentiate Eq. (6-9-18) twice and 
insert the result into the Larmor formula. 
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6-1. Consider a classical electron going in a circular orbit around a proton with an 
initial radius equal to ro. 
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6-2. 


6-3. 


6-4, 


6-5. 


6-6. 


6-7. 
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(a) Derive an expression for the radius r of the electron as a function of time. 

(b) Calculate the time it would take for the electron to spiral into the proton if 
it started out at a radius of 107° cm. (This is actually typical of the time it 
takes for an electron to go from an excited state to the ground state.) 


A nonrelativistic charged particle is brought to rest from an initial velocity v9 
by means of a constant acceleration ay. That is, 


Vo fort < 0 
Vv, =<{U — at for0 St S vo/ay 
0 for t 2 vo/ap 


What is the frequency distribution of the radiation seen by an observer at a great 
distance from the particle? 


A relativistic particle is slowed down by means of a constant acceleration a 
directed opposite to its direction of motion. Plot the laboratory intensity per unit 
solid angle of the radiated energy as a function of cos 6 for 8 = 0.6, 0.8, and 
0.95. The angle 6 lies between the direction of motion of the particle and the 
direction of radiation in the laboratory. 


Derive expressions for the electromagnetic fields E and B corresponding to a 
point charge moving with constant velocity v by transforming from the rest 
system of the charge into the laboratory frame of reference. Compare your 
results with Eqs. (6-3-9) and (6-3-11) as evaluated in the case where a’ = 0. 


Consider a charge on the end of a spring moving according to the differential 

equation 

d*x 

a = + W’x = 0 

where W,) >> »/2. It starts out with some amplitude which then damps to zero 

exponentially. 

(a) Find the frequency spectrum which characterizes the emitted electric and 
magnetic fields. 

(6) Plot the intensity of radiation as a function of frequency in the neighborhood 
of wo. How does the width of the frequency distribution relate to the dis- 
sipation constant 7? 

(c) What is the ratio of the mechanical energy stored in the system to the mechani- 
cal energy lost per cycle? (This number is called the Q of the system.) 


Suppose that an oscillating charge is emitting electromagnetic radiation at a 
frequency wo. We turn on a receiver and investigate this radiation for a time period 
T. If we Fourier analyze the intensity of the received radiation, we will find that 
it is largely confined to a frequency range Aw in the vicinity of wy. Carry out 
the Fourier transform of the observed radiation intensity and then estimate Aw. 


A pair of equal and opposite point charges with electric dipole moment p rotate 
with angular frequency w about an axis perpendicular to the line joining them. 
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Show that the energy radiated per unit time can be approximated by the expression 
du 2 p’w* 
d= 30 


in the event that the separation is much less than the wavelength of the radiation. 
Compare this with the sum of the energy which would be radiated were the two 
charges traveling separately and independently. 


Suppose for a moment that magnetic monopoles existed. What are the radiation 
fields that would result from the acceleration of a point magnetic monopole q'””? 


A magnetic dipole y is perpendicular to an applied magnetic field and precesses 
about it with frequency w. How much energy is radiated per unit time as a result 
of this precession if the wavelength corresponding to w is much greater than any 
of the physical dimensions of the dipole? [Hint: Make use of the results of 

Probs. 6-7 and 6-8. ] 


7 
The Interaction of Radiation 
with Matter 


Having understood the interaction between radiation and a simple charge, 
be it free or be it bound in a harmonic potential, we are ready to take the 
next step and go to an assembly of charges. Inasmuch as matter is made up 
of just such an assembly, we would hope to be able to demonstrate all the 
well-known principles of optics by following the electromagnetic radiation 
in detail. The kinds of questions we will answer are the ones which have 
troubled almost anyone who has studied optics—how does the light get 
turned around in a mirror, and why does the light appear to go “slower” 
in glass than in air? Just as always, our approach will be the physical one 
of saying that currents produce fields which shake charges which produce 
234 
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more fields which shake other charges, ad infinitum. The beauty will lie 
in the rather simple way in which everything will fall together. 

Now the actual situation within material at the microscopic level is 
quite complicated, and in the end we will need the power of quantum 
mechanics to deal with it exactly. There are numerous things we would 
have need to worry about including the absorption and reemission of 
radiation by atoms, the scattering of radiation by “free” electrons, the 
magnetic polarization of the atom, and even the electron spin. Our ap- 
proach will be to solve several simple and idealized examples exactly and 
then develop a formalism so that when we have finally solved the micro- 
scopic problem exactly, we will be able to find the macroscopic properties 
of matter directly. The general expressions that we will derive will have 
far-reaching applicability (even into the realm of elementary particle 
physics), and even the simple models we take will come remarkably close 
to reality. 


7-1 THE ABSORPTION AND REFLECTION OF RADIATION BY AN 
IDEALIZED CONDUCTING SHEET WITH NO MAGNETIZATION 


We begin with the simplest of systems, radiation striking normally onto 
an idealized conducting sheet with conductivity o. The sheet is assumed to 
have a magnetic permeability of unity; that is, no magnetic dipole moment is 
present anywhere. Our experience tells us that very little, if any, of the 
radiation is transmitted through the sheet. We also know that a large part 
of the radiation may be refiected back into the direction from which it 
came. We would like to understand these results in some detail. 

As before we will assume that the origin of the incoming radiation 
is some accelerating charge a very long distance away. We can then con- 
sider that our incoming wave is a plane wave; the loci of points of equal 
field form planes perpendicular to the direction of propagation. As we 
just learned, we can subdivide the incoming wave into harmonic compo- 
nents and superimpose the results later; this permits us to use a simple 
incoming wave of the form’ 


E,(z,1) = Eye7t#- 
Byg,t) = Ege 


where w = kc. We will, for convenience, take E, = B, = 0. 
Now to begin our treatment of the problem, we place a sheet of 
conductor of infinitesimal thickness in the path of the radiation, as shown 


(a1) 


1Our work will be immeasurably simplified by making use of the exponential in this way. If 
the true incoming wave has the form E,cos(kz — wt), then we need consider only the real 
parts of Eq. (7-1-1) and of all equations for the electromagnetic waves which derive from it. 
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in Fig. 7-1. We choose the thickness 6 to be much less than 22/k, the wave- 
length of the radiation, and small enough so that the amount of radiation 
produced by the sheet is infinitesimal when compared with the incoming 
wave. We further assume that the current density induced in the conductor 
at any point is directly proportional to the electric field at that point. 
(Needless to say the proportionality constant may be frequency dependent. 
However at any given frequency, the assumption of proportionality be- 
tween current density and electric field is a reasonable one.) Hence we 
write, at z = 0, 

ern ict 

Jx aE e ( 7. 8 2) 

Je ag oe = 0 

We now find the induced vector potential due to j, at some point 
on the positive z axis at a distance z to the right of the sheet. (By induced 
we mean not present in the original incoming field but produced through 
the mediation of the current in the conducting sheet.) Using cylindrical 
coordinates, we write 

iw(t — R/c) 


2n ?max(8) 
A,'"4(0,0,2,4) = 6 | d6 | ile emementnone (7-1-3) 
0 0 R 


Fig. 7-1 A thin sheet of conductor of thickness 6(6 < 2z/k) is placed perpendicular to 
incoming radiation. Its conductivity is o. 


y out of paper 


Incoming beam of radiation 
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However, r? + z? = R?, and, since z is fixed, we can write 
2r dr = 2RdR (7-1-4) 


Substituting back into Eq. (7-1-3), we obtain 
5 2n Rmax(6) 
boE,e'™ | do | e *® dR 
0 iz 


r) E iwt (2 ‘ ; 
| de MRate _ ¢- tx) (7-1-5) 
19) 


ind 
A, 


Now the term e7 *®=sx) varies extremely rapidly with R,,.,(9), taking on 
every possible phase if R,,,,(0) varies as much as a wavelength. Hence we 
can ignore this term; it will average out to zero. This leads us to the result 


2ndaE 


ind —_ 
A (z,t) = ik 


eee) a forz = 0 (7-1-6) 
When we began, we chose our z axis arbitrarily, requiring only that it 
lie within our incoming beam of radiation. Hence the result we obtained 
in Eq. (7-1-6) is true for any point that lies a distance z to the right of the 
conducting sheet. provided that it is within the original beam. If we choose 
a point which is outside the original beam of radiation, we would have to 
replace z in Eq. (7-1-5) by R,,:;,(9) and the exponential would average out 
to zero. Hence there is no induced vector potential outside the original beam. 

Now we can have a look at the induced vector potential to the left 
of the conducting sheet (z < 0). The complete symmetry of the situation 
requires that 


At Zz) =F A,(z) (7-1-7) 


Hence we can write 


2ndoE,e**+™ 


A,(z,t) = i forz <0 (7-1-8) 
We now proceed to find the induced electric and magnetic fields. 
| 7m 
ind =) ee x 
Ey (z, t) “és Cc Ot 
—Insakge -°) fer za 0 
= ; (7-1-9) 
— 276d Ege *™ for z < 0 


(We have made use of the fact that k = w/c in obtaining the above expres- 
sions.) 


Bee. ay (V x AS); 
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4 a a 

~ @z 
—2ndcE,e""-™ forz>0 

Z : (7-1-10) 
200E, eo for z < 0 


We should rnake a number of observations about the induced fields. 
First, going off to the right of the sheet is an induced plane wave propor- 
tional to and exactly 180° out of phase with the incident wave. When added 
to the incident wave, it diminishes its amplitude by a factor 1 — 2206. 
Obviously, E and B remain mutually perpendicular, and the Poynting 
vector is still in the +z direction. Second, going off to the left of the sheet 
is a reflected plane wave. The electric field in this induced wave at position 
—z is identical with the electric field in the induced wave at position +z. 
The induced magnetic fields, on the other band, are equal and opposite at 
equal distances to the left and right of the sheet. The Poynting vector 
associated with the reflected plane wave points naturally in the —z direction. 
Finally we note that the amplitudes of the induced fields are independent 
of frequency (provided, of course, that 6 < A = 2n/k). Any frequency 
dependence which we discover when dealing with a thick conducting plate 
must result then from the interference between the fields generated by 
layers at various depths below the surface. 

Before proceeding, let us have a look at the energy balance within 
the thin sheet. We would like to show that the energy coming in per unit 
time minus the energy leaving per unit time is just equal to the ohmic heating 
of the sheet. Immediately to the right of the sheet we have 


E(+0,) = E,°(0,1) + £,'"(+0,2) G-1-1) 


where +0 refers to “0” approached from the right. Using Eq. (7-1-9), 
we find 


E,(+0,) = E,°°0,)(1 — 2nd0) (7-1-12) 
Similarly, using Eq. (7-1-10), we obtain 
B,(+0,2) = B,i"°(0,2)(1 — 2x50) (7-1-13) 


Combining these results, we derive the Poynting vector just to the right 
of the sheet. 


S,(+0,t) = —— E,'*°(0,1) B,'®°(0,1) (1 — 4n6o + 422620) 
4n 2 


Ignoring terms of order 5”, we have then 


S,(+0,0) = S,i"(0,2)(1 — 4260) 
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= (1 = 4n50)LE, "(0,07 


energy leaving unit area of sheet, to the right, per unit time 
(7-1-14) 


The amount of energy leaving to the left per unit time per unit area is 
proportional to 6? and can thus be ignored. The rate at which energy is 
being delivered to the sheet per unit area is then 

ca =O. [ees (Ost),|? (7-1-15) 
This is just equal to the rate of ohmic heating for a volume 6 of material 
[see Eq. (4-1-6)]. 

Now that we know how to find the radiation from an infinitesimally 
thin slice of conducting sheet all of which has the same current density 
jx, We can proceed to the case of a thick conducting plate being irradiated 
normal to one face by a plane wave of frequency w. For convenience we will 
first allow the conductor to extend from z = 0 to z = oo (see Fig. 7-2). 
As before, the incoming radiation will have the form 


E 0) = cree) 
x 


B30) = Eee eo) (7-1-16) 


Fig. 7-2. Incoming radiation is normally incident upon a semi-infinite 
conducting sheet with conductivity o. 


dz' 
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Naturally, we can slice our conductor into an infinitude of individual, 
infinitesimally thick slices and add together each of their contributions. 
A typical such slice, of thickness dz’, is shown in Fig. 7-2. We are interested 
in finding the fields at a distance z from the face of the conductor, and our 
typical slice is taken at a distance z’ from that face. 

Unfortunately, however, the field which is shaking the charges at z’ 
is precisely the field we would like to find. We can no longer make the 
approximation that this is equal to the incoming field. Hence we allow the 
function E,(z,t) to be an unknown. Inasmuch as our incoming plane wave 
has frequency w, we shall assume that 


E,(z,t) = E,(zje (W117) 


Now if our typical slice dz’ at z’ is to the left of z (z = z’), then its con- 
tribution to the field at z is just given by [see Eq. (7-1-9) ] 


GE" (2,1) <ee— 2rakee je az (7-1-18) 
On the other hand, if z’ > z, then the contribution of dz’ to the field at z is 
dE, !""(z;1) = —2nek, (2 )e*e-2 1 de’ (7-1-19) 


The total field at z is given by a sum of three terms, the incident field, the 
contributions from slices to the left of z, and the contributions from slices 
to the right of z. Thus we have 


z 
E, (z)e™ = Ey e ikzgiot _ Ingeivt | E,(z’)e~ ik(z—2") dz’ 
0 
. 2 * 
—2noei { E, (z’)e*™@-2dz’ 
z 


Canceling out e and rewriting slightly, we have the basic integral equation 
for E,.(z): 


iE (Z) = Eve” ke ane | sEA2 je dz’ 
0 


—2nael** | E,(z’)e7 ™' dz’ (7-1-20) 


Similarly, we find the basic integral equation for B,(z) by adding together 
the three parts, the incoming plane wave, the contribution from slices to 
the left of z, and the contribution from slices to the right of z [(see Eq. 
(7-1-10)]. 


B,(z) = Ege” — 2noe~™ | Bee az 
0 


+ 2nce'* | E,(ze""™"' dz’ (7-1-21) 


z 
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Now we address ourselves to finding the solution to Eqs. (7-1-20) 
and (7-1-21). The simplest way to proceed is to differentiate E,(z) twice 
with respect to z, obtaining a differential equation. 


= = —ikE je” + 2noike™ i Egle’ )e**" dz 
— 2naike'* im ee je Pode’ 
ae = —k®E,e" + Inok*e7 [eseve™ dz’ 
+ 2nok*e'* [- E,(z/)e7#" dz’ + 4noikE,(z) 
or ' 
_ = (—k? + 4noik)E,(2) (7-1-2) 
We also note that 
Oi) «ix, (7-1-23) 


Differentiating B,(z) we will find that it satisfies the same differential equation 
as E,(z). In any case it will be much simpler to just calculate dE, /dz and 
divide by — ik to find B,. 

Before proceeding, we will make a slight digression. The sophisticated 
student will observe immediately that we could have obtained these dif- 
ferential equations much more rapidly if we had started with Maxwell’s 
equations. We would however have lost all our beautiful insight into the 
origin of the fields in terms of flowing currents. Nevertheless we shall do 
so now if for no other reason than to provide the student with an easily 
remembered way of rapidly obtaining the appropriate coefficients in the 
differential equation. We start with 


1 OE 
ec ee ee 
c ot c of 
1 
vee 
c Ot 


Taking the curl of the second equation and remembering that in this case 
V-E = 4zp = 0, we have 

V x (V x E) = V(V-E) — VE 
1 


= -WE= — en 
c oat 
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1 oE 1 6?E 
= — —|4nr0 — + — — 7-1-24 
(400 5 + =m) ( ) 
If we search for a solution of the form 
E,(z,t)= Eze ‘:and . E, =Ep=0 
we obtain for our differential equation 


PE, 4no. w? 
<a =~ DE, ~ y E, 


= (4ncik — k?)E, 
just as before. Taking the curl of V x B will obtain for us the same equation 
for B,. 
We return to the problem at hand, the solution of which is now 
apparent. In fact there are two possible solutions: 


E(@) = exp (—./4naik — k?z) (725) 
exp (+./4noik — k?z) (7-1-26) 


Our first problem is how do we evaluate ./4naik — k?? The answer will 
be immediately apparent if we just write 4ncik — k? = ./k* + (4nok)? e’®. 
Then 


/4npik — k? = [k* + (4nck)* }te'?/? (7-1-27) 
The important thing to note here is that @/2 is always in the first quadrant, 
and hence ./4naik — k? has a positive real part if o has any value at all 
other than zero (see Fig. 7-3). The solution given by Eq. (7-1-26) would 
diverge as we approached infinity and hence must be rejected. We are left 
then with 


E,(z) = A exp (—./4noik — k*z) (7-1-28) 


1 


Fig. 7-3 The expression —k? + 4noik 
plotted in the complex plane. 


7-1 ABSORPTION AND REFLECTION BY A CONDUCTING SHEET 243 


We wish to find A to complete our solution to the problem. To do this we 
need only substitute back into our integral equation. 


A exp (—,/4noik — k*z) = E,e 
= anode { exp [(ik — ./4naik — k*)z’] dz’ 
0 


— anode | exp [(—ik — ./4noik — k?)z’] dz’ | (7-1-29) 


Carrying out the integrals indicated, we find 


2noA ; 
eee er atl 7-1-30 
( 9” ik — /4ncik — a) ( ) 


This leads to the result 


7 Loy A 
gee /4naik — k ik E, (7-1-31) 


210 


We have then for the field within the conductor 


E,(z,t) = eisai |e Ep exp (—./4noik — k?z) e! (7-1-32) 


210 
To find B,(z,t) we just differentiate E, with respect to z and divide by — ik. 


. -_ Fa Ss . 
B,(z,t) = ,/4noik — kK? Vell Sold 


2noik 


Ey exp (—./4naik — kz) e! (7-1-33) 


Now these expressions look quite hopeless except for the fact that, for most 
metals over a large frequency range, we can ignore k by comparison with 
4no. In that frequency range we have 


| [ke k k 
E,(z,t) = | —_ + i( —_ -x<)| Eo exp (— ./ 2nokz) 


exp[—i(./2nokz — wt)] (7-1-34) 


. ay oe 
B,(z,0) = 2|(1 = i“) ns rs 


Ey exp (—,/2n0kz) exp [—i(./2n0kz — wt)] (7-1-35) 


As the ratio of k to o becomes smaller and smaller (either better conductivity 
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or lower frequency), we can ignore k/2x0 by comparison with ./k/2o and 
./k/8no by comparison with 1. In that case we can further approximate 
E and Bas follows: 


Exe, e f= Ep, exp (—./220kz) 


exp |-i(. /2nokz — wt — *)| (7-1-36) 
B,(z,t) & 2E, exp (—./2nokz) exp [—i(\/2nokz — wt)] (7-1-37) 


The magnetic field in the conductor thus has a magnitude that is much 
larger than that of the electric field. It also lags the electric field by about 
an eighth of a cycle in the case where k/o <« 1. Both fields drop off in 
magnitude as exp (—./2n0kz). The term 1/./2z0k, called the skin depth, 
is the distance within which the fields drop by a factor of e. 

Now to round out our evaluation of the electromagnetic fields in 
this simple example, let us calculate the reflected electric and magnetic 
fields. We make use of our integral equations again and note that 


— 2naell*? + 4 | exp [—(./4naik — k? + ik)z’] dz’ 
0 
/4ncik — k? — ik , 
Ey noik k ik eitkz + ot) (7-1-38) 
/4ncik — k? + ik 
We obtain B,'*"'(z,1) through the equation 
Bee = 6,1) (7-1-39) 
It is interesting to note one other rather important point which is apparent 
from looking at the integral equations for E,(z) and B,(z). Both E,(z) 
and B,(z) are continuous at z = 0. In general the tangential components 
of the electric and magnetic fields are both continuous when going across 
a nonmagnetic boundary. This can be ascertained through an inspection 


of Maxwell’s equations, as follows. 
Faraday’s law states that 


1 do 
Reg =| — = 2 
} @ (oh 


E,*"(,1) 


If we take our curve of integration as shown in Fig. 7-4, the leg 6 can be 
taken arbitrarily small and hence ® can be made arbitrarily small. Thus 
E,(region 1) = E;(region 2). Similarly, unless there is a finite surface 
current within a layer of zero thickness, the equation 


1 oE 
Veen 47. — 
- a) 
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Boundary 
|_- between 
media 
a ime Fig. 7-4 The tangential components of E and B are 


continuous across the boundary of our nonmagnetic 
conductor. 


ensures that the tangential component of B is continuous across the 
boundary. Obviously, if we are dealing with a piece of magnetic material, 
this continuity no longer holds because the sharp charge in magnetization 
is equivalent to a finite surface current. 

These boundary conditions along with the solution to the differential 
equation for E and B in the conductor can serve to fix E and B completely 
without having recourse to the integral equation. Before we leave our 
conducting sheets, let us see what happens if we remove the restriction 
that they be of infinite thickness. We begin by treating one sheet of finite 
thickness and then extend our technique to a succession of any number of 
such sheets of varying conductivity. We take our sheet to lie between z = 0 
and z = a (see Fig. 7-5). For some point x within the sheet (0 < x < a), 
we can again express E,(x) in terms of a sum of the incoming field, the 
field contributed by currents on the left, and the field contributed by currents 


Fig. 7-5 Electromagnetic radiation strikes normal to a conducting sheet 
of thickness a and conductivity o. 


Incoming wave 
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on the right. The equation is identical to Eq. (7-1-20) except for the fact 
that oo is replaced by a. 


E,(z) = Ege"™ — 2nce™ | E, (2’)e"**’ dz’ 
0 


- anoe™ | E,(z’)e~ *' dz’ (7-1-40) 


Again E,(z) satisfies the differential equation d’E,/dz? = (—k? + 4naik)E,, 
and again we can write, in general, 


E, = Aexp(—./4noik — k*z) + Bexp(+./4noik — k?z) (7-1-41) 
This time, however, we have no right to remove the second solution 


[exp (./4naik — kz)] because we are constrained to the region 0 < z < a. 
If we substitute back into our integral equation, we find A and B by in- 
sisting that the equation hold at any two points z = 0 and z = a, for 
example. (The equation will then automatically hold at all other points in 
the conductor.) We obtain 


A+ B=E, - ano | [A exp (—./4noik — k?z’) 
0 
+ Bexp (+./4naik — k?z’)} e~' dz’ (7-1-42) 


and 
A exp (—./4noik — k?a) + Bexp(+./4naik — k?a) = Eye '** 
— 2noe™ i | [A exp (—./4naik — k?z’) 
0 
+ Bexp (+./4naik — k?z’)] el dz’ (7-1-43) 
As you can easily see, the solution to the problem, while messy, is 
quite straightforward. There are two independent equations for A and B 


in terms of Ey, o, k, and a. Having found A and B, we can proceed to find 
the reflected and transmitted waves just as before: 


ES(2,1) = —2nce ton | A exp (— /JAnaik — k?2’)e~*" dz’ 
0 
+ | Bexp(+./4naik — k?z’)e7 ** a | (7-1-44) 
0 


Ee: 1) = Byes "529 = Inae~ ke - ot) 


x I A exp (—./4noik — k?z’)ei**’ dz’ 
0 
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+ | Bexp (+./4naik — k?z’)ei’ a | (7-1-45) 

0 
Suppose now that instead of having to deal with only one sheet we had two 
sheets, as shown in Fig. 7-6, the first with conductivity o, and the second 


with conductivity o,. If we let E, ,(z,f) and E,, ,(z,2) be the solutions inside 
the two conductors, respectively, we can write, in general, 


Ehagt) = Eea(eece 
= [A exp (—,/420,ik — k?z) 
+ Bexp (+./4no,ik — k2z)]e (7-1-46) 


£3, t) = E,, (ze 


= [C exp (—./4n0,ik — k?z) 
[ p ( 2 


+ Dexp (+./4no,ik — k?z)Je (7-1-47) 


We can write down two integral equations 


Zz 
E,,:(z) = Eye™™* — 2noye™ | E, ,(ze*#" de’ 
0 
ee 
— Inge |, ce az 
Zz 


- 2nore™ | Ey, 2(z’)e~**' dz’ (7-1-48) 
b 


E,.2(z) = E,e°** = 2no,e™ | E,,4(2/)e**" de’ 
0 


Fig. 7-6 Electromagnetic radiation strikes normal to two conducting 
sheets. 


Incoming wave 
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Zz d 7" 
aces 2no,e™ | E.. (ae dz’ = ano,e™ | Ee (ae dz’ 


[ts Zz 


(7-1-49) 


If we insert Eqs. (7-1-46) and (7-1-47) for E, ,(z) and £, ,(z) into our integrals 
and evaluate Eq. (7-1-48) at z = 0,a and Eq. (7-1-49) at z = c,d, we will 
have four simultaneous equations for A,B,C,D. Thus our solution to 
E,.(z) will be complete at least within the conductors. The extension to the 
regions outside of the conductors is trivial and is left to the reader. 

We might add here that any number of parallel conducting sheets can 
be treated in this manner. For N sheets we will need 2N coefficients for the 
solutions, and the N integral equations will give them to us. 


7-2) WE ALLOW THE CONDUCTOR TO HAVE MAGNETIC PERMEABILITY 


We will now generalize slightly by allowing our conductor to have uniform 
magnetic permeability 4. We can of course retrace all our steps and ask 
what a thin sheet of magnetic-moment distribution will yield for A and 
then for E and B. It is simpler however to just make use of Maxwell’s 
equations in the same manner as we did on page 241. We now have 


vx (2) = ance +S 
mM c 


= (7-2-1) 
V = at 0B 
oO 
This leads to the equations 
2 
ren me 4 oe 
i a (7-2-2) 
4nop OB nu 07B 
2B ee Le cellent 
Md ¢ Ter ee ee 
Again we search for solutions of the form 
15s O= ES tcot 
E, = E, =0 
We find the differential equation for E, to be 
adhe ; 
= (4nopik — uk?)E, (7-2-4) 


dz? 
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The result is then 


exp i= /u(anoik — k?)z] (7-2-5) 
E,(z) = 
‘ exp [./u(4naik — k?)z] (7-2-6) 


Let us now find the complete set of solutions for the case of the semi- 
infinite slab of conductor extending from z = 0 to z = o. As before, 
we can reject solution Eq. (7-2-6) as diverging at infinity. This yields 


E,(z,t) = Aexp[—./u(4noik — k*)z]e  forz>0 (7-2-7) 


The magnetic field B, can be obtained from Maxwell’s equation: 


OE 1 OB, 
Ss — — —" = —jk 7-2-8 
oz c Ot xB, ( ) 
Hence 
1 dE, 
oa ik éz 


2 ite) NC ae 
forz >0 (7-2-9) 
The incident wave has the form 
He (2,)i= Eye Ve"o (7-2-10) 
Bueelz tie Baan (7-2-11) 


The reflected wave has the form 
B24) = Deikz + 2) (7-2-12) 
Baez, t) = — Deilkz + 21) (7-2-13) 
where D is a constant, to be determined from the boundary conditions. 
Our job is now reduced to finding A and D. We do this by requiring 


that the tangential components of H and E be continuous across the bound- 
ary. This yields 


Ey + D=A 


peepee A,/u(4naik — k?) 


ik 


(7-2-14) 
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The solutions for A and D are 
a KES (TNS) 


./ul(4noik — k*) + ik 
iku — ./u(4noik — k?) 
D= es 
iku + ./ul4naik — k?) 


Again, it is instructive to look at these solutions in the limit of large 
a. We have then 


Eo (7-2-16) 


Ci an a pr (722-17) 


/4nciku 1o 


The fields in the conductor become 


[k 
EXG;t) — E,exp(—./2nokp z) 


exp| —i( /Inokuz — wt — =) (7-2-18) 


B,(z,t) = 2E, exp (—,/2nokpz) exp[—i(,/2nokpz — wt)] (7-2-19) 


We compare with Eqs. (7-1-36) and (7-1-37) to note the approximate effect 
of the permeability pu. 


1. The skin depth is decreased by the factor \/. 
The magnetic field B, just inside the conductor’s surface is increased 
by a factor of u. 

3. The electric field just inside the conductors is increased by a factor 
CRY p22 


A= 


7-3 THE PHYSICAL ORIGIN OF THE REFRACTIVE INDEX 


We now find ourselves in the beautiful position of having developed a 
machine so powerful that we can just stand back and pile up our profits. 
When we carried out our analysis of the interaction of radiation with a 
conducting sheet we made only one simple assumption, that the current 
density j,(z) was proportional to the electric field at that point E,(z). We 
called the proportionality constant o and identified it with the conductivity 
of the material. However, nothing that we did after that implied that the 
constant had to be real (as it is in the case of free electrons in a conductor) 
or even that it had to be frequency independent. The physical arguments 
which we followed for adding together the effects of all the currents at 
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various values of z would have been the same if o had been a complex 
number with some frequency dependence. Thus we open up a wide range of 
applicability for the treatment we have just completed. In particular, we 
can deal with all situations where the local induced currents are proportional 
to the applied electric field, provided, of course, that the medium in question 
is sufficiently uniform and dense so that the variation in the number of 
scattering centers within a volume of order /° (A is the wavelength of the 
radiation) is negligible. (It is the violation of this last proviso that causes 
the sky to be blue. Why?) 

We will now begin with a rather simple model of a dielectric as a 
collection of charges, each of which is trapped within a harmonic oscillator 
potential. Using this model, we will show that the currents which flow are 
indeed proportional to the applied fields. By carrying over the expressions 
we have derived for a conductor, we will discover that dielectrics appear 
to have a classical index of refraction, and we will derive an expression for 
this index in terms of the frequency of the incident radiation, the natural 
frequency of the oscillations, and the density of electrons. We will then 
show how it is possible to generalize our results to permit us to treat a 
dielectric exactly, given a description of the microscopic forward-scattering 
amplitude. 

Consider first an individual electron of charge g and mass m in a 
harmonic oscillator potential with spring constant map’. (Its resonant fre- 
quency is Ww.) We let the incident radiation on this electron be of the form 


E,(z,t) = Eye i##-@ (7-3-1) 


If the electron is at z = 0 and it sees a resistive force equal to my dx/dt, 
then the differential equation describing its motion (see Sec. 6-9) is 


. JB 
oe + y & + Stily 7 es a got (7-3-2) 
Letting x = xe", we find 
Xo = qo (7-3-3) 


m(Wo? — w? + iyo) 


Suppose we now take a thin sheet of material made up of N such 
charges per unit volume. If we subject this sheet to an electric field E, = 
E,e', then we will induce a current density 


P pam p \e 
nom) 
GE," 


q \; is 
= (5) 00) pea a Say — 
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(The factor c converts j, into emu.) Hence it appears as though the material 
has an effective conductivity given by 
iNq?k 


m(w@* — w* + iyo) 2) 


ete = 
From here on, we can make direct use of all the machinery we developed 
in Sec. 7-1. Wherever we find an expression involving o, we Can USE Ger 
instead. 

Let us return then to the semi-infinite slab of dielectric extending 
from z = 0 to z = oo (see Fig. 7-2). Again we can take the incoming radia- 
tion to have the form 


E*(2)2) ‘a Dee alle 0) (7-3-6) 


There are two possible solutions for the field in the dielectric. Using Eqs. 
(7-1-25) and (7-1-26) and substituting o,,, for o, we find 


4nq*k?N 
— —— 2 an - = 
exp) k 2G, 2 | (7-3-7) 


E,(2) = 


exp] —k? — (7-3-8) 
Simple inspection will show that the radical will have a positive real part, 
and hence Eq. (7-3-8) can be removed as a possible solution. This leaves 
us with 


4nq’?k?N 
eS Se re 
M(@o* — w* + iy) 


E,(z,t) = Ae~ iho) (7-3-9) 
where 
4nq*N 
no ft aay wie 


The quantity 7 is called the index of refraction. We see that a given phase of 
the plane wave within the dielectric moves with an apparent velocity given by 


c 
Uphase = n (7-3-11) 
Going back to Eq. (7-1-31), we find A: 


- J 4 egik — k* — ik “ 


= 0 
206 ets 


_ ink — ik |, _-2@=) 
2 ns oe 
_ 2E, 
n+ 


Ey 


(F-3212) 
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We have then our final solution for the electric field in the dielectric: 


2) ee 
Lay — Ceo 7S 0) (7-3-13) 


The magnetic field B,(z, 1) can be obtained by differentiating E, with respect 
to z and dividing by —ik. 


2nEo 


— i(knz — wt) ae 
eae forz > 0 (7-3-14) 


B(z,0) = 


To obtain the reflected waves we make use of Eqs. (7-1-38) and (7-1-39): 


Ink — ik f 
E refl 2) in. E i(kz + wt) 
cael le ink + ik °° 
La keto 1 
Pay gee forz <0 (7-3-15) 
B refl not i(kz + wt) 
y (Zt) = Pes Ege forz = 0 (7-3-16) 


7-4 WHAT HAPPENS WHEN n <1? PHASE VELOCITY AND 
GROUP VELOCITY i 


We will now have a good hard look at our expression for index of refraction 
and see if we can understand something about the propagation of signals 
within the corresponding medium. To simplify our considerations we will 
set y = 0. This is not a bad approximation for most dielectrics far from 
their resonant frequencies; a detailed treatment of the behavior of a dielec- 
tric in the vicinity of its resonances is beyond the scope of this book. Under 
this assumption we can write 


4nq*N 


milo? — 0) eek 


n= /1+ 
Again we remember that our wave travels within this medium as e~**"*~ ©), 
But what does this mean? Can it be that for n < 1 (which happens whenever 
@ > @o) we have a signal traveling faster than c? This would seem to be 
in patent violation of everything we have learned when we studied relativity. 
Furthermore, it seems absurd, since we never added together anything but 
waves traveling at velocity c. 

The answer of course is very simple. Just because a wave appears to 
travel with velocity greater than c does not mean that information is moving 
with that same velocity. We will illustrate by making use of an elementary 
mechanical example. 
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Let us set out a long row of uncoupled springs as shown in Fig. 7-7. 
Each spring has a mass m hanging on it and has a spring constant equal to 
md, 7. (Hence the spring and mass will oscillate naturally at frequency wy.) 
We take the distance between springs to be 6z. 

Suppose we now pluck the first mass so as to set it into oscillation. 
A very short time dt later we pluck the second mass in exactly the same 
way. A time 6¢ after that we pluck the third mass, and so on until all the 
masses are oscillating. As we watch the springs oscillate, it will look as 
though the peak of the oscillation is moving along with a phase velocity 
equal to 6z/dt. Since dt can be made arbitrarily small, the phase velocity 
can be made arbitrarily large (see Fig. 7-7). 

So far nothing bothers us at all because no information is being 
transferred from one spring to the next. They are all moving completely 
independently. Suppose we now wanted to transfer information along the 
chain. We would have to link the masses by some means and then we would 
have to give one mass an extra pluck so that it had a bit more energy than 
the others. This extra bit of energy would then be transferred down the 
line, and we would see the various oscillators gain amplitude and then 
lose amplitude again as the information passed through them. Needless 
to say, the velocity with which the peak amplitude moved down the line 
would not in general be the same as the phase velocity. 

From looking at our mechanical model we now can extract the key 
idea. Information is transferred only when energy gets transferred, and 
hence there must somehow be a change in the world wide distribution of 
energy density. As long as we are dealing with nothing but a plane wave 
of fixed frequency extending from —0oo to +00, there is no such change 
with time. Either we have to turn the light on at some time, in which case 
we would have to track the leading edge of the light beam, or we would 
have to produce fluctuations in the energy density of our beam and watch 
those fluctuations move along. 

Consider first the problem of tracking the leading edge of a light 


Fig. 7-7 We cause a wave to propagate with phase velocity equal to dz/dt by plucking the 
masses sequentially with time interval dt between plucks (see text). No information is being 
transferred because the masses are all uncoupled. 
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beam that has just been turned on. Initially the individual electrons as 
they were struck by this light beam would exhibit all the transient behavior 
which we have so far blithely ignored. We could solve the problem exactly 
by making our light beam up out of a superposition of different frequencies 
using Fourier analysis techniques. We could then include the transient 
terms and add everything together. Needless to say, the very leading edge 
would not move faster than c. (In fact, as seems intuitively reasonable, 
it will move with a velocity just equal to c.) In any case, to solve this problem 
exactly is rather tedious and somewhat beyond our scope. 

How about producing fluctuations in the energy density of our electro- 
magnetic wave and watching those fluctuations propagate? We can do 
this rather easily by adding together two waves with frequencies that lie 
very close to one another. We let x = kn and use frequencies w and wm + dw 
which correspond, respectively, to x and x + dk. We have then 


E.(z, t) Eee + E,; e7 ilk + dx)z—(w + do)t] 


=e ike-On( B+ E,e7 axe dandy (7-4-2) 


As can be readily seen, the energy density corresponding to E,, varies with z 
and has an apparent wavelength equal to 2x/dx. The peaks in energy also 
move with velocity equal to dw/dx. This velocity is the one we are interested 
in evaluating and is called the group velocity v,. 


— dw 
dk 
cn 
= a eo 
’ 4nq? No * Ge) 


mo? — wo")? 
Inspection shows that v, is always less than c. When @ > Wo, the 
index n is less than 1 and the denominator is greater than 1. For m < @p, 
the denominator is always greater than the numerator divided by c. In 
the limit that we can ignore w? compared with w,’, we have 
(se 


eee abe phase velocity 


Finally, before we leave these harmonic oscillators, let us see how reasonable 
our expression for index of refraction is. We would not have much use for 
it if it yielded a value of 1000 in the visible when everyone knows that for 
most transparent materials with a density of about | gram/cm? the index 
is about 1.5. 

First we note that w, is typically in the ultraviolet for a material 
like glass. Let us take wy = 2 x 10'° rad/sec and w = 4 x 10*° rad/sec. 
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Let us take N ~ Avogadro’s number = 6 x 107°. We know that 


2 
< = (c’) (classical electron radius ro) 


= (9 x 107°)(2.8 x 107*9) 
We have then 


, 4n(6 x 1077)2.5 x 10°) 
Ae 10°" 


Not bad for a crude guess with a crude model. 


7-5 THE INDEX OF REFRACTION IN TERMS OF THE 
FORWARD-SCATTERING AMPLITUDE 


Unfortunately, the real world is not made up of a collection of simple har- 
monic oscillators, and we would never be able to handle a real problem 
unless we developed some techniques which had broad practical applicabil- 
ity. It turns out to be quite simple to do so. We will assume that the micro- 
scopic problem of the interaction of radiation with a typical atom has been 
solved and show how we can go from there to an expression for the index 
of refraction of macroscopic matter made up of these atoms. 

We will assume that the entire atom is coherent insofar as the fre- 
quencies that interest us are concerned. By this we mean that the atom 
is so much smaller than the wavelength of light that we need not pay any 
attention to its physical dimension. Now if we allow a plane wave of radiation 
to fall upon the atom, it will absorb and reradiate some of it in all directions. 
In particular, some of it will be reradiated in the same direction as the in- 
coming radiation, and in interfering with the latter will lead to the effective 
index of refraction. 

It is clear that our index of refraction should depend then on the 
number of atoms per unit volume, on the frequency of the incident light, 
and on the ratio of the amplitude of the forward-scattered radiation to the 
amplitude of the incoming radiation. To make the connection we will 
resort to a very simple trick. We will replace our atom with a classical 
accelerating charge that leads to exactly the same amplitude of forwardly 
scattered radiation. We will then make use of what we have learned to 
calculate the effective conductivity for an assembly of these classical charges. 
Having done this, we can immediately write down an expression for the 
index of refraction. 
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To begin with, let us define the so-called forward-scattering amplitude 
Ff (0) for a single atom, as follows. 


amplitude of forwardly emitted radiation 
f(0) = at point | cm directly forward of scatterer 
~~ amplitude that incident radiation would have 
at that same point in absence of scatterer 


(It is of course assumed that the incoming radiation and the forwardly 
emitted radiation have the same polarization. We shall shortly learn how 
to deal with the more general case where this assumption is removed.) 
Suppose now that we had an incident plane wave given by 


E,(z,t) = Eye i#-™ (7-5-1) 


Imagine now that we place our classical charge at z = 0 and allow it to 
have an acceleration given by 


g=a,e'" (7-5-2) 


At a distance z in front of the charge we would see a field due to this accelera- 
tion and equal to [see Eq. (6-4-2) ] 


oe ia(t — z/c) 
E,(due to charge) = ae 
Gz 
oat — i(kz — wt) 
= (7-5-3) 
Coz. 
The forward-scattering amplitude will then be given by 
— 40 
= —— 7-5-4 
fO'= ae (7-5-4) 


In other words, if we know /(0), then we can replace our atom with a charge 
whose acceleration is just given by 


ge 
= =eSOE eit (7-5-5) 
q a 
The velocity of the charge is just 
“2 
= ic! fOEo eit (7-5-6) 
wq 


If we make up a macroscopic sheet of these charges with N charges per 
unit volume, we have a current density given by 
q_ _ Nif(0)Eoe™ 


eo ae 7-5-7 
jp = Noy = (7-5-1) 
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The effective conductivity is then 
Nif(0) (7-5-8) 


Cee = je 


Substituting back into Eq. (7-1-25) we find 


4nNf(0) 

ie 
This expression relates the index of refraction to the atomic forward- 
scattering amplitude (0) and the number of atoms per unit volume, N. 
The expression is exact for wavelengths much larger than atomic dimen- 
sions and permits us to solve the macroscopic problem as soon as we have 
solved the microscopic problem. 


hie (7-5-9) 


7-6 THE FARADAY EFFECT 


Of the various phenomena of physical optics none is more beautifully 
illustrative of the techniques and ideas we have developed than the Faraday 
effect. Let us apply an external magnetic field to a dielectric and then 
introduce plane-polarized radiation into it along the direction of the magnetic 
field. We will find that the polarization direction will rotate through an angle 
proportional to the distance we penetrate into the medium. The ratio of 
angle turned to distance traversed will depend approximately linearly on 
the magnetic field for reasonable fields. When evaluated for a unit magnetic 
field the ratio is called the Verdet constant. We will find in addition that 
this entire phenomenon can be understood in terms of a difference in the 
index of refraction for left- and right-hand circularly polarized light. 

We begin as usual by considering a rather simple model, an electron 
of charge q and mass m bound in a harmonic oscillator potential. We will 
assume a restoring force equal to — mw 7r where r is the vector displace- 
ment from the equilibrium position. In addition we will assume an applied 
magnetic field Bé, in the z direction and incoming radiation of the form 


Bigs) = Biggest ange May (7-6-1) 


(We use é,, &,, and é, as the unit vectors in the x, y, z directions to avoid 
confusion with other quantities labeled by i, 7, k.) 

For convenience we will place the center of our harmonic oscillator 
at the origin of our coordinate system. We set the force on the charge 
equal to its mass times its acceleration 


ax d’y . 

m dt é, + de arts] = —M@)? (xé, + yb) + qEoxe'é, 
tot A qB dy qB dx A 

+ qE oye gay + ae gles e dt &> (7-6-2) 
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Now, rearranging terms and letting w, = cyclotron frequency = gB/mc, 
we have two equations 


a + Wx — &, = = te: eo (7-6-3) 

= + Wo’) + @, = = Tay gi (7-6-4) 
Petting x = xe" and y = ype”, we find 

(@o? — w*)xo — iww.yo = os (7-6-5) 

(@9* — w)yo + iww,xy = fy (7-6-6) 


We solve for yp and x, and obtain 


= Ql(@o* — 7)Eo, + iww,Ep, | 


0 lo? — 0)? — oa?) — 
and 
Dine: 2 | en E 
Yo = gl(@o @ ) Oy 1WO), ox] (7-6-8) 


m[(ao* — w*)? — w7@,*] 


We can now proceed in one of two ways. We can calculate the currents 
in the x and y directions in terms of E, and E,, leading to effective con- 
ductivities (in this case, four numbers), and then put everything together 
as we have before. Alternatively, we can think a bit about the physical 
situation and by being clever come to a conclusion much more rapidly. 
We shall do both inasmuch as cleverness is not always a substitute for 
brute force in the world of real problems. 

First, let us be clever. We ask what would happen if the incident 
radiation were circularly polarized? Certainly the trapped electron has no 
choice but to go in a circle since there is no preferred direction defined in 
the xy plane. The magnetic field then would either push the electron toward 
the center of the circle or pull it away from the center. Thus there would be 
two different radii, depending on whether the light coming in was left-hand 
circularly polarized or right-hand circularly polarized. Reversing the 
direction of the applied field would clearly interchange the radii corre- 
sponding to right- and left-hand circular polarization. 

To test our intuition, we can now solve the problem exactly. Take 
first the case of left-hand circularly polarized light incident upon the charge. 
As we remember, this means that at a fixed value of z the electric field 
vector appears to be rotating counterclockwise as we look along the +z 
axis. How shall we express Ep, and Eo, in order that the total electric field 
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behave in this manner? Very simple. We remember that in the end we will 
be interested in only the real parts of the expressions we obtain for E, and 
E,. If we let Eo, = iE ,, then we have 


E,=E,.° and Eieeee (7-6-9) 


The real parts of E, and E, then rotate in the desired direction as a function 
of time. 
Substituting this into our expressions for x9 and yg, we have 


q. Ox 
eg en ee 7-6-10 
ae M(@o? — w* + wo,) ( ) 


= qEoy ce 
J. MW? — w? + ww,) ao) 
The charge q just follows the electric field, either in phase or 180° out of 
phase, depending on the sign of g and on the sign of wo? — w* + wa,. 
In any case the rotating charge produces its own left-hand circularly 
polarized radiation in the forward direction. The forward-scattering 
amplitude, defined as before, is just 


pli ; re: 1G 
Ons 7 ( «) Rg ee (7-6-12) 


If matter were made up of N such harmonic oscillators per unit volume, 
the index of refraction for left-hand circularly polarized light traveling in 
the direction of a magnetic field B would be 


4nNq? 


en ee 
4 M(Wo* — w* + ww,) 


(7-6-13) 

To go from left- to right-hand circularly polarized light is trivial. 
We remember that reversing the handedness of the polarization gave the 
same result as reversing the magnetic field B. Indeed, it is only the direction 
of B which establishes a handedness in the first place. Hence we can obtain 
our result for np by just letting w, go to —@,. 


nese + ae 7-6-14 
pi MO? — w* — wo,) i 
(Alternatively we could let Ej, = —iEo, and substitute in the expressions 


for x9 and yg.) At this point it is reasonable to note that ng and n, differ 
very little from what they would be with no field. Typically wo is about 
2 x 10° rad/sec and w, is 17.6 x 10° rad/sec for 1 gauss. Inasmuch as 
magnetic fields are not usually more than 10* to 10° gauss, we expect w, 
to be no more than 10!' to 101” rad/sec. Typical values of «, in the visible, 
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are in the region of 4 x 10'* rad/sec. Hence w,? is larger than wo, by a 


factor of about 10° to 10°. We are then entitled to make a linear approxima- 
tion for n,; and ng. 


i Ale 
= 62) 
= aoa 


mena (7-6-15) 


I 
= 
| 


fp = n+ —s (7-6-16) 


2n Wo? — w? 


Now that we have shown that right- and left-hand circularly polarized 
light have different indices of refraction, we would like to see what would 
happen if we began with plane-polarized light. As we have seen earlier, 
the essential characteristic of left-hand circularly polarized light propagating 
in the z direction is that the amplitude of the y component is i times the 
amplitude of the x component. Remembering that left-hand circularly 
polarized light propagates with index of refraction n,, we write 


E,(z,t) = Aen ihre -20g, 4. jem tenez— ong (7-6-17) 


where A is an arbitrary constant to be determined to fit given boundary 
conditions. Similarly 


Ep(z,t) = Be ittnne—ong _ jBe-ilinnz— ong (7-6-18) 


We now ask can we superimpose right-hand circularly polarized light 
and left-hand circularly polarized light so as to produce light which is 
linearly polarized in the x direction at z = 0? That is to say, we search for 
values of A and B so as to produce a field of the form 


E(0,t) = Enei'é, (7-6-19) 


at z = 0. Obviously this can be done by choosing A = B = E)/2 and 
adding together the two circularly polarized plane waves. We have then 


E i —tknyz —iknpz 
ce ee 3 (7-6-20) 


die i(e7 teu? < e” Maas | 
If we make the approximation 
Nr =n+ A 
(7-6-21) 
n=n—A 
then we obtain a simplified expression for E(z,?): 
E(z,t) = Eye ~°[(cos kAz)é, — (sin kAz)é& } (7-6-22) 


The polarization thus rotates from the x direction toward the —y direction 
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(for positive A), going through an angle kAz in a distance z. The Verdet 
constant is the rate of change of this angle with respect to distance, per 
unit field. 


_kA 


V 
B 


(7-6-23) 
For the particular model we have taken, we would have [see Eqs. (7-6-15) 
and (7-6-16) ] 


q ae a 


es 
mc? Wy)? — w* 2n 


rad/cm-gauss (7-6-24) 


Now, what if we had not been so clever as to see that the natural 
solution to this problem makes use of circularly polarized radiation? 
Is there some way in which we can treat the problem in a “brute force” 
manner which would lead in the end to the “‘natural”’ states for a description 
of the system? Fortunately, the answer to this question is affirmative. 
Furthermore, the technique we will develop will be of such broad generality 
that we will be able to make use of it to dea) with systems of substantial 
complexity, provided that we can solve the problem on the atomic level. 

Before generalizing, however, let us go back to our solution to x, 
and yo in terms of Eo, and Eo, [see Eqs. (7-6-7) and (7-6-8) ]. Having found 
x and y, we can determine the current densities j, and j, by differentiating 
with respect to time and multiplying by Nq/c. We have then 


iwNq fs ik Nq? (9? aa w)Eox ag iww, Eo iw 
Se Ne =, oy - cee 
ee iwNq wwe ikNq? (@o7 =z w*)Eoy ze iww, Eox iot 
Jy = Yo ae (@2 — w)? — wa,? e (7-6-26) 


We notice immediately that these equations have the general form 


af: =a pes oe a Gyr, 


| (7-6-21) 
Jp = 05, her Or, 


Indeed, this form is sufficient to describe the most general linear index of 
refraction problem where transverse fields are propagating only in the z 
direction. Let us proceed to solve this general problem; the solution to 
our specific problem of the Faraday effect will then fall right out. 

Again we make use of integral equations for the case of a semi-infinite 
slab to give us the physical insight we need to derive the differential equa- 
tions. As before, we break up the contributions to the fields E, and E, at 
some position z into those arising from the incoming fields, the currents 
to the left of z, and the currents to the right of z. The only difference is that 


7-6 THE FARADAY EFFECT 263 


both x and y currents can be caused by either x or y fields. We have then, 
canceling out the e’ term from both sides, 


EC —aepee — Ire = i, [onBZ') + aE, @) je dz’ 

— 2ne™ | ° [6,,E,(2’) + 6,,E,(2z’)Je7* dz’ (7-6-28) 
E,(2) = Eye” * — 2ne~™ {, [¢,,£.(2) +.6,,B,(2 le dz’ 

— 2ne'# ‘i [o,,E,(2’) + 6,,E,(z’)Je~* dz’ (7-6-29) 


If we differentiate twice with respect to x, we obtain the coupled differential 
equations 


dE, 
Pi = M,,E, + M,,E, 
oe (7-6-30) 
a: 2 M,,E, a5 M,,E, 
where 
M,,, = —k? + 4niko,, 
M,, = 4niko,, 
(7-6-31) 


M,,. = 4niko,, 
—k? 4+ 4nikoy, 


SS 
] 


We note that if we let y be the state given by wy = E,é, + E,é,, then we 
can write 


ay 
Ga 
where 
M,.. M,, 
M= (7-6-32) 
Me MM; 


(M is a tensor of the second rank in this two-dimensional space.) 

Let us see if we can find some w for which My = Aw and J is a con- 
stant scalar. That is, we look for a polarization state which goes into itself 
when operated upon by M. w is then called an eigenstate and A is its eigen- 
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value. We desire that 
M,,. — AE, + M,,E, = 90 
( ) ni (7-6-33) 
M,,E + (M,, — A)E, = 0 
This requires that the determinant 


M,,. — Awe 


= 0 (7-6-34) 
M,, M,, — A 
We have then, for A, the equation 
A? — A(M,, + M,,) + (M,.M,, — M,,M,,) = 0 (7-6-35) 
There are two solutions for A which we call A, and A_, respectively. 
M. M 
Ay = EE + AV MY + AM, (7-6-36) 
The two solutions corresponding to A, _ are w, _, given, respectively, by 
E, As a" M,.. 
2 = =__= 7-6-37 
E. M,, (7-6-37) 
Now yw, obeys the equation 
& 
Ot = dae (7-6-38) 
and hence 
W+(z) = Ws (0) exp (+i/ —A,2) (7-6°39) 
Similarly 
y_(z) = w_(0) exp (4i/ —4_2z) (7-6-40) 


Now for any given initial polarization (0) = Eo,é, + Eo,é, we can 
decompose (0) into w,(0) and w_(0) components. 


W(0) = Av, (0) + By_(0) (7-6-41) 


At some other value of z,t we would have then, for waves propagating 
in the +2 direction, 


W(z,t) = Aw. (0) exp [-,/—-4,2 - of] 
+ By_(0) exp [—i(./-—A_z — wt)] (7-6-42) 


In this way we obtain the solution to the general problem of radiation 
propagating within a medium. Incidentally, the indices of refraction cor- 
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responding to the two values of 4 are just 


Vv a7A4+,- (7-6-43) 


k 


We now apply this general method to the solution of our model for 
the Faraday effect. We have in that case 


ny - = 


ikNg? 2 = 2 
ikNq Wo (6) = (7-6-4) 


Me =aiie = — Kk? 4 Anik 8 es oe 
2 oi (@o* — w)* — w*o, 


= 


4nik —kNq’aw 
M,, = .—M,, =—— £ -6- 
xy yx m (a2 — wo) — wo,2 (7-6-45) 
Thus 
; 2 re 4nNk?q?(wo? — w) ie 4nNk*q? wu, 
wh m[ (wo? _ w)? ee wa,” |} i [ (wo? ea w?)? aes ww.” |m 
4nNq? 
= —k? ae 7-6-46 
[tiie creer sore a 
The two indices of refraction are 
4nNq* 
C= = 1+ [(wo? — w*) + wa, ]m (7-6-47) 
The eigenstate corresponding to A, has 
E, A, —M, 
— = —*____** —j (left-hand circular polarization) — (7-6-48) 
ie M,,y 
The eigenstate corresponding to A_ has 
, AL. —M, i : pe 
Z = aa a =—i (right-hand circular polarization) (7-6-49) 
x xy 


We have thus reproduced our earlier result in this completely general 
manner. 


7-7 WE REMOVE THE REQUIREMENT OF NORMAL INCIDENCE; 
FRESNEL’S EQUATIONS; TOTAL INTERNAL REFLECTION 


We have heretofore concerned ourselves only with situations where radia- 
tion was incident normally onto the interface between the materials. We 
will now remove this restriction and examine in detail what happens when 
the radiation has an angle of incidence equal to 0;. The reflected radiation 
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will then have an angle of reflection 6, and the transmitted radiation an 
angle of refraction 0, (see Fig. 7-8). 

We first ask how one can write down a plane wave traveling in a direc- 
tion given by a unit vector &. The plane wave will have two possible 
polarizations given by é, and é,, where we take 


é, x & = 6, 
& x eueee. 
& x 8 = 6 


As usual, we let w be the frequency of the radiation and define k = w/c. 
Then the wavelength is just given by 27/nk where n is the index of refraction. 
The phase of our plane wave must then change by 27 if we proceed along 
&, by a distance equal to 27/nk. Hence we can write, in general, 


E(r,t) = (Eosé, + Eoé.) exp [ —i(nké, +r — wt)] (7-7-1) 


Fig. 7-8 Radiation is incident upon an interface between two media, having indices of 
refraction n, and n2, respectively. The angle of incidence is 6;. 


Nn, 


é,(trans) 


é,(trans) 


é, (inc) 


é,{inc) 
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We also remember that the magnetic field within a medium is n times 
the electric field and is at right angles to it so that E x B points in the 
propagation direction. The magnetic field corresponding to Eq. (7-7-1) 
is thus 


BO.) = —n(Eo.é, — Eopé.) exp [ —i(nké,-r — wt)] (7-7-2) 

We now return to Fig. 7-8 and set up a coordinate system at some 
point on the interface. We let the z axis represented by unit vector é&, be 
normal to the interface and point from medium n, to medium n,. The 
incident radiation is taken to come from the n, side, and the incident direc- 
tion is taken to lie in the xz plane. On general grounds of symmetry, the 
directions of the reflected and transmitted radiation must also lie in the 
xz plane. 

It is now convenient to rewrite Eqs. (7-7-1) and (7-7-2) in terms of 
one standard coordinate system. We can take é, to be along the y direction 
for the incident, reflected, and transmitted radiation. Hence we will uni- 
formly set é. = &,. The component of electric field in this direction, which 
was previously called Eo,, will now be called Ey,, inasmuch as it is per- 
pendicular to the plane of incidence. As far as é, is concerned, we see that 

é, (incident) = cos 6; &; — sin 0; & 

&,(reflected) = —cos 0, é, — sin 0, &; 

é,(transmitted) = cos 6, é; — sin 6, & 


The amplitudes of electric field along these directions will be called Ep), 
rather than Ep,, to indicate that the field is parallel to the plane of incidence. 
The incident electric and magnetic fields then have the form 


E'(r,t) = [Eo.'"® & + Eo,'"° (cos 6; &; — sin 0; &)] 
e7 ilnik(z cos ; + x sin 8;)— or] (7-7-3) 
B'(r,) = —n,[Eo,i"*(cos 6; &, — sin 0; 83) — Eo) '"* & | 
e~ iltik(z cos 0; + x sin 6;)— wt] (7-7-4) 
Similarly, the reflected and transmitted plane waves have the form 
Evl(r,t) = [Ep if! 8, — Ep)" (cos 0, &, + sin 6, &3) 
e 7 ilnik(<sin 6, —z cos 8-)— ot] (7-7-5) 


Br"'(r,t) = ny[E {"' (cos 0, é, + sin 6, 8) + Eo," & | 
e7 Hnik(x sin 8, — x cos 8-)— ot] (7-7-6) 


Ens(p f) = 2 rsa é, + Eo\'""" (cos 6, €, — sin 8, é3)] 
e7 ilnzk(z cos 6 + x sin 6) — ot] (7-7-7) 
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B'*"5(s t) = —nmg[E, (cos 6, ei oir sin 0, &;) a be é, | 
a i[n,k(z cos 0; + x sin @¢) — wt] (7-7-8) 
We are now ready to apply the boundary conditions at the interface. 
We require that the tangential components of E and B be continuous. 


Since the tangential components are those along é, and é,, we can write 
(remembering that z = 0 at the boundary and factoring out the e’” term) 


Eo imeem imkxsings 4 Fp tefl imikxsin8, Fr transg—inakx sin 6, (7-7-9) 
Eo,'"° cos 6; e) inikx sin 8; as Egy cos 6. e7 inikx sin 6, 
= Egy" cos.0..e° =" (7-7-10) 
In order that these equations hold for any value of x, we must have 
n,kx sin 0; = n,kx sin 6, = n,kx sin 0, 


Thus we learn that the angle of incidence and the angle of reflection are 
equal. Furthermore, we have derived Snell’s law relating 0, to 6;. 


n, sin 6; = n, sin 0, (7-7-11) 


Rewriting Eqs. (7-7-9) and (7-7-10) and adding the equations for the tangen- 
tial components of B, we have 


inc retl == trans 
Eo’ + Eg = Egy 


(Eo4""° = Eo\"""') cos 6; = Pee cos 0, 


i (7-7-12) 
neg aa Eee cos 6; = Loe cos 6, 
ny (Eo, '"° ae Eggi") = n, Eo) '*" 
Solving these, we obtain the well-known Fresnel equations 
n, cos 6; — n, cos 8 , 
E refl (lL  eeaet E,,, ine 7-7-1 
ia n; cos 8; + n, cos 6, °* (7-7-13) 
27; ©08:0, E5,3P° 
E trans __ eed Co 7-7. 
ie n, cos 6; + n2 cos 8, (7-7-14) 
8, — n, cos 6 : 
E,, re! = ea aa LN ine 7. 
all n, cos 0, + n, cos 6, °! (7-7-15) 
27 COS Oy Ene 
Ee = —— ee (7-7-16) 


n, cos 6, + n2 cos 8; 


We notice immediately that E),"*’ can be zero at a particular angle 
of incidence. This occurs whenever 


n, cos 8; — n, cos 8, = 0 
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Inserting Snell’s law into the above, we have, for no reflection, 
sin 6; cos 6; = sin 0, cos 8, 
or 


Tt 
6,;+ 6, = zy (7-7-17) 
If 6, and 6, satisfy this condition, then 0; is called Brewster’s angle. 
Finally, let us consider what happens when we have total internal 
reflection. If n, is greater than ,, then there certainly exists a range of 


incident angles for which 


1 sin 0, = sin 0, > 1 (7-7-18) 
i) 
A little bit of thought indicates that this is no problem if we allow 0, to be 
a complex number. We find cos 0, by the usual relation 


cos 6, = +./1 — sin? 6, = +i//sin? 0, — 1 


n a 4 
SLi (=) sin? 6, — 1 (7-7-19) 
nz 


To simplify our notation we let 


ny 2 : 
one (=) sin? 6; — 1 (7-7-20) 
ny 


We then note that the phase factor e7 '"***°°s® becomes 


+tnokza 


eo inakz cosOr __ et 


We must eliminate the positive exponential inasmuch as it diverges as 


we let z go to infinity. Thus we accept only the negative alternative in 
Eq. (7-7-19). 


cos 9, = —ia (7-7-21) 
This gives us 


wanes __ 274; cos 0; F, inc 
Fo. pa a Ou 
n, cos 8; — in,a 


rer! _ 71 COS 8; + inna Eine 
Eo.” See, OL 
ny cos 6; oe N,a 
(7-7-22) 
ae 2n, cos 6; E 
Eo} = ae 0|| 
n, cos 6; — in,a 


inc 


Ett = M2 cos 6; + inya Eo)" 
oll n, cos 6; — ina 
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As we anticipate, the magnitudes of the reflected waves are identical 
to the magnitudes of the corresponding incident waves but out of phase 
with them. The transmitted electric and magnetic fields are given by 


ida ae : eel, 
EB (r;2) = | Bost _ Eee sin 0; &; + at) 
nN 


en n2kaz —inyk sin 6; x tiwt (7-7-23) 


Line 
2 


7 n2kaz— inyk sin 0; x + it (7-7-24) 


A simple inspection of the Poynting vector will show that, averaged over 
time, no energy is flowing in the z direction. The only average energy flow 
is in the x direction, as expected. The characteristic distance in the z direction 
within which the amplitude decreases by a factor e is just 


tee —E—E eee (7-7-25) 


naka 2n ./(n,/n2)? sin? 0; — 1 


We can obtain a feeling for the magnitude of 6 by taking some typical 
values for 1,, m2, and sin 0;. We let n, = 1.5 and n, = 1 (corresponding to 
a glass-air interface) and take 8; = 45°. 


bet te 0451 


2x /Z -1 


Typically then the electromagnetic radiation can extend out for a wavelength 
or so from a totally reflecting surface. 


PROBLEMS 


7-1. Find the skin depth in copper, aluminum, nonmagnetic stainless steel, and sea 
water for radiation at @ = 10° rad/sec, 10° rad/sec, and 10'° rad/sec. 


copper = 1.6 x 107’ abamperes/statvolt-cm 

Satuminum = 1.1 x 107’ abamperes/statvolt-cm 

Ostaintess steel = 0.3 x 10’ abamperes/statvolt-cm 
Osea water = 1.5 abamperes/statvolt-cm 


7-2. Referring to Fig. 7-2, calculate the time-averaged force per unit volume on the 
semi-infinite conductor by the magnetic field, as a function of z. Integrate from 
z = 0toz = o to find the average radiation pressure on the conductor. Compare 
this result with the change in momentum per unit area of the incoming and re- 
flected radiation. 
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7-3. Radiation of frequency is normally incident upon a semi-infinite slab of 
index of refraction n,, which has been coated with a thin layer of index n, (the 
thickness of the layer is 5). 


_—_—_— > § 


(a) Find a general expression for the reflected intensity as a function of , 
Ny, Nz, and 6. ; 

(b) Given ny, find a set of values for n, and 6 for which the reflected intensity 
at frequency w is zero. 


7-4. Linearly polarized light of the form E,(z,t) = E,e~'“*~@ is incident normally 
onto a material which has index of refraction mg for right-hand circularly polarized 
light and n, for left-hand circularly polarized light. Describe the reflected light 
quantitatively from the point of view of intensity and polarization. 


7-5. Electromagnetic radiation of frequency is incident on a thick conducting plate 
of conductivity o at an angle of incidence 6;. Find the intensity of the reflected 
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rr 
Incoming radiation 


7-7. 


7-8. 


éy 
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light for incident polarization parallel to and perpendicular to the plane of 
incidence. 


‘A semi-infinite slab of matter is constituted of N charges per unit volume, each 


of which is held in place by an asymmetric harmonic oscillator potential. The 
restoring force on each charge for a displacement in the xy plane is 


F = —mao,’xé, — mao,’yé, 


é, 


é, out of paper 


Left-hand circularly polarized light of frequency w is normally incident upon 
the slab, as shown. The amplitude of the incident radiation is Eg. 

(a) Find the electric field as a function of time at a distance z into the slab. 
(b) Find an expression for the reflected radiation. What is its state of polariza- 
tion? 


A magnetic field B is now applied normal to the surface of the slab in Prob. 7-6. 
What are the two indices of refraction and what polarization states do they 
correspond to if we take wo,” = 4@o,7? 


Explain how the Faraday effect can be used to determine the sign of the mobile 
charges in a block of dielectric. 


8 
Multipole Expansion of the 
Radiation Field; Some 
Further Considerations on 
the Interaction of Radiation 
with Matter; Interference 
and Diffraction 


8-1 A GENERAL STATEMENT OF THE PROBLEM 


When we studied electrostatics, we found it convenient to express the electro- 
static potential at a point r a large distance from a spatially limited charge 
distribution in terms of a power-series expansion. The expansion parameter 
was r’/r where the vector r’ denoted the position of charge contributing 
to the potential. This sort of expansion is not particularly useful when we 
deal with radiation because as we have already learned, the radiation fields 
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must decrease as 1/r in order that there be a net energy outflow from the 
radiator. There is however another parameter which is of prime importance, 
the size of the source as compared with the wavelength of the radiation. 
It is clear that if the size of the source is comparable to the radiation wave- 
length, then various parts of the source will interfere coherently. 

We shall take then the point of view that r’ is sufficiently less than r 
so that only electric and magnetic fields decreasing as 1/r must be considered. 
Subject to this assumption we will develop a rather general method for 
finding the fields and then make the further approximation that r’ is much 
smaller than the wavelength of the radiation. In the event that the latter 
assumption is not true, we can in many cases subdivide the source to the 
point where it is true and add the results later. This technique will be used 
when we reexamine the index of refraction and when we study diffraction 
phenomena. 

For convenience we will concern ourselves only with charge and current 
distributions which are harmonically periodic in their time dependence. 
(If the time dependence of our actual problem does not fit this criterion, 
we can decompose it into harmonic components by means of Fourier 
analysis.) . 

If w is the angular frequency of the time dependence, then we write 


j@'.9 = j@e 
p(r’,t) = p(r’)e™ 


(8-1-1) 


Substituting into our well-known expressions for A and g [see Eq. 
(5-6-3)], we have 


A(r,t) = ei | Werner dV’ (8-1-2) 
g(r,t) = e | re dV’ (8-1-3) 


where, as usual, k = w/c = 2n/A. 

We have already agreed to consider r’ to be infinitesimal compared 
with r. Hence 1/ lr — r’| varies only negligibly as we vary r’. We can replace 
it by 1/r and take it out of the integral. On the other hand, exp (—ik|r — r’|) 
can vary considerably because 7’ and A are much more comparable. We 
note then that 


lr — | = fr? +r? — Over’ 


We 
™ 
| 


(8-1-4) 
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If we let & = r/r, then we can rewrite the potentials as 


— i(kr — wt) 
A(r,t) = ——— | i() exp (ik. r’) dV’ (8-1-5) 


— i{kr — ot) 
o(r,t) = —— [oc exp (iki 1’) dV’ (8-1-6) 


So far we have only assumed that r’ < _r and have not made any 
assumption about the magnitude of 2ar’/A = kr’. Indeed, in many cases 
we can carry out the above integration completely and never have any 
reason for further approximation. On the other hand, there are many 
situations where kr’ << | and further approximation is useful. For example, 
an atom with a typical size of 107 ® cm will emit radiation with a wavelength 
of the order of 5 x 107° cm. For those situations, we are entitled to write 


exp (ikfi-r’) = 1 + ikfi-r’ — 4k7(f-r’)? + °°: (8-1-7) 
Substituting into the above equations yields a power-series expansion 
for A and 9: 
A=A,+A,+A3;+-°°- 
C= Daas + 
where 


e 7 ikr— ot) . 
Ay = —_— [ie dv’ 


e 7 ilkr— ot) 
A, ae = ik fire or’) dV’ 


and so forth. 
We could now proceed to find the electric field by taking —Vo — 


= = and the magnetic field by taking V x A. It is much more convenient 
e 


however to make the following observations: 
E(r,d) = E(rje’™ 


(8-1-8) 
Bir, t) = Bare 


, 1 OE 
Hence, making use of Maxwell’s equation V x B = ae we have 


E(r) = xv x B(r) 


l 
= Fal x [V x A(r)] (8-1-9) 
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This obviates the necessity of knowing @ at all and simplifies our job 
considerably. 


8-2 ELECTRIC DIPOLE RADIATION 
As a first approximation we can ignore kr’ completely. We then find 


e 7 ikr— ot) ae . 
A@,) = —— | i) av (8-2-1) 


Wecan cast this into a somewhat more elegant form by a bit of manipulation. 
We evaluate the x component of f j(r’) dV’: 


20) dV’ = fie -€, dV’ 
= {ie Vx! dV’ 
= |v - xj") dv’ — | x'V! + i(r’) dV’ (8-2-2) 


Note that the first of the integrals on the right side of Eq. (8-2-2) can 
be turned into an integral over a surface enclosing the current distribution 
completely. On that surface j = 0, and hence we conclude 


| ia = — [xv jr’) dV’ 


Generalizing to three dimensions, we obtain 


{ie dVi= — fetv - jr’) ] dv’ (8-2-3) 
Next, we remember the conservation of charge equation: 
: 1 dp(r’,t) 
Vv’ : st eee 
je.) oe 


Hence, using Eq. (8-1-1), we have 
V’-j@’) = — ikp(r) (8-2-4) 


Substituting back into Eqs. (8-2-3) and (8-2-1), we obtain our expression 
for A,(r,2): 


ik — i(kr — ot) 
A, (r,t) = me [roe ave (8-2-5) 
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The integral now has a very familiar form. Consider the time-dependent 
electric dipole moment of the charge distribution. 


p(t) = [roe.o dv’ 


= [roe dV’ 
= pee (8-2-6) 
Clearly then 
ee iter~ on) 
A,(r,t) = Sa ee (8-2-7) 


Finally we must determine B and E. We take the curl of A but ignore the 
term which goes as 1/r?. 


B, =Vx A, 


ikVe~ i(kr — wt) 


r 


lle 


X Po 


k2 en i{kr — ot) 
=, x Po) (8-2-8) 


To find E we take the curl of B and again ignore the 1/r? term. 


1 
E, = x B) 


ke 
= ayes) x (EX po) 


ikr 
ae k2e7 i(kr — wt) 
= <x (@ x po) 
k2 e7 ier ot) 
eee“ ame [Po — (Po fi) A] (8-2-9) 


A little bit of thought allows us to rewrite Eqs. (8-2-8) and (8-2-9) in 
slightly different form: 


k? r 
E,(r,4) = “gee (: a “) 


B,(7,) = fi x E,(r,d 


where p,(t — r/c) is the component of the dipole moment perpendicular to 


(8-2-10) 
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fi and evaluated at the time ¢ — r/c. Physically then, to find the electric 

field at r, we put our eye there and observe the charge distribution as it 

appears to us at that instant. (Naturally we are really looking at the charge 

distribution as it was a time r/c in the past.) We calculate the electric dipole 

moment of this observed distribution, take the component perpendicular to 

our line of sight, multiply by k?, divide by r, and we have our answer. 
We would like to compare this result with the one we found earlier 

for the case of an accelerating charge [see Eq. (6-4-2)]. Imagine that we 

have a charge g undergoing harmonic motion of the form 

=e 

Then the acceleration is just 
a= —w’re™ 

On the other hand, the electric dipole moment is given by 
P() = gr(t) = groe’™ 


Hence 
q 
P(t) a oe w? a(t) 


Using Eq. (6-4-2), we have 


_ ~9ap(t — r/c) 
cr 


E(r,1) 


= wpp(t — r/c) 
- cr 


ie k* pp(t — r/c) 
: 


This confirms the general result of Eq. (8-2-10) for the simple case of a 
harmonically oscillating point charge. 

The Poynting vector can now be evaluated and averaged over one 
cycle. This would usually require that we find the real parts of E and B, 
respectively, then take their vector product and finally do the time averaging. 
We can save ourselves some effort however by making the following observa- 
tions: 


CS) ine ay = 


Fe ' B+B 
An 5) 5) time av 


= 7 <Ex B+E xB’ +E’ x B+ E* x B’)simeay 


8-2 ELECTRIC DIPOLE RADIATION 279 


Now E x:B is proportional to e?'', which is zero when averaged over 
time. Similarly E" x B" is proportional to e~7'', which also averages to 
zero. Hence 


c 
Sie meee 
< Witeat l6zx <E x B F E x BD) cay 


= ie Re <E x B ime 
8x 


Since E x B* has no time dependence we need not actually carry out any 
time averaging and we can write directly 
é 


Sine ay — 8x 


Re (E x B’) (8-2-11) 


Applying this general result in the case of our dipole we have 


c k*|po|? sin? @ 


<S +f) timeav = an a (8-2-12) 
where 
cos § = a Po 
Po 


Integrating over a sphere, we find the average amount of energy 
radiated by the dipole per unit time: 


4 4 2n T 
du _ ck*|po| | au | sin? 6 dO 
0 


dt 81 0 


4 2 
_ ck" |pol* (8-2-13) 
S 
Lastly, let us apply what we have just learned and calculate the radia- 
tion field due to the simple short dipole antenna shown in Fig. 8-1. We 
assume that the overall length of the antenna is a and thata <« 4. 
The current in the antenna is taken to be 


11 — =) for z’ > 9 
Kz’, t) = 


(8-2-14) 
wo + ae jem for z’ <0 
a 


In this model then, the current is in the same direction in each half of the 
antenna and falls off linearly as we approach the ends. 
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a/2 


a/2 Fig. 8-1 We calculate the radiation field 


due to a short dipole antenna of length a. 
We assume that a < A where A is the 
wavelength of the radiation. 


To obtain the dipole moment, we can either find p(z’) or we can just 
make use of Eq. (8-2-1) directly. It is simpler to do the latter. 


po = i [ile av 
“el a) | ee 
z Lae (8-2-15) 
The angular distribution of radiated power is just 
= — (ka)? I? sin? 6 (8-2-16) 


Thus for a given maximum current, the average radiated power 
increases as the square of the frequency in the situation where a << J. 
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In the next order of approximation we consider the second term in the 
expansion of exp (ikfi-r’) [see Eq. (8-1-7)]. Substituting into Eq. (8-1-6), 
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we obtain 


—i(kr— wt) 


A(t, = a ik | j@)(@-r) dv’ (8-3-1) 
It is convenient to break the integrand up as follows: 
ji-r) = 3[j@-r’) — ’G-f)] + 4h@-r) + rG- A] 
= 2[f x Gx r)] + 2[i(-r) + r(a-j)] (8-3-2) 


The part of the vector potential corresponding to the first term on the right 
of Eq. (8-3-2) will be called, for reasons which will become apparent shortly, 
the magnetic dipole potential A,,,. The part corresponding to the second 
term will be called the electric quadrupole potential A,,. We examine the 
magnetic dipole potential first. 


— i(kr — wt) , ers , 
mei) = —ik<—— x | — (8-3-3) 


The integral on the right is nothing other than the magnetic moment of 
the current distribution. That is, 


H(t) = Moe’ (8-3-4) 
where 
Ho = 4 | r x jr’) dV’ (8-3-5) 
Hence 
— i(kr — wt) 
Asn(tst) = —ik—— ft tg (8-3-6) 


Taking the curl of A,, and ignoring all terms which go as 1/r?, we find the 
magnetic and electric fields. 
Bom = Vx Aom 


—i(kr — wt) 


= =——1 x (A X po) 


ile 


en i(kr — wt) 


=k? Sa [Ho — (Ho > A)A] 
ke r 
= — ———— 8-3-7 
= wp (0- =) (8-3-1) 
1 
E,n = al x Ban, 


= BS, (8-3-8) 
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Note the amazing similarity to the expressions we obtained for electric 
dipole radiation. The magnetic field here has the same relation to the pro- 
jected apparent magnetic dipole moment as the electric field has to the 
projected apparent electric dipole moment. 

An example of such a magnetic dipole is the simple circular current 
loop shown in Fig. 8-2. We take the loop to have circumference a and let 
the current in the loop be given by 


Kt) = Ine (8-3-9) 


(In this case the total length of wire involved in the antenna is the same as 
it is for our previous electric dipole.) We will again calculate the average 
energy radiated per unit time per unit solid angle and compare it directly 
with what was obtained in the case of the comparable electric dipole [see 
Eq. (8-2-15)]. The magnetic moment of the loop is just 

Z 


a 
Ho = Ip Tee (8-3-10) 


Hence 


Pu 
(2 “) lime av 7 rs : A ne av 


— ore ——— (ka)*Ig? sin? 0 (8-3-11) 


It is instructive then to compare this result with Eq. (8-2-16). We have, for 
comparable currents and a comparable size, 


(8-3-12) 


eu 
dt dQ mag dipole __ (ka)? a (4) 


( du ) An? 
dt dQ elect dipole 


The intensity of magnetic dipole radiation is thus characteristically 
smaller by a factor of about (a/A)* than that of electric dipole radiation. 


Fig. 8-2. Asimplecurrent loop 
with current I(t) = Ibe’ emits 
magnetic dipole radiation. 
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(This point is of considerable significance in analyzing the radiation of 
atomic states. When we get to study atomic transitions, we will find that 
some of them cannot take place through the medium of electric dipole 
radiation and hence are appropriately suppressed.) 
We next return to the so-called electric quadrupole potential. 
e7 ilkr~ at) 


Ajg(t,t) = ik oe {ua “r’) + r(f-j)] dV’ (8-3-13) 
Consider the first part of the integral. We will examine its x component. 
[nc -r)dV’ = {o or )j-é, dV’ 
= fo © jh Viera, 
= \v [x(a rj] dv’ — [xv -[-r)j] dv’ 


We eliminate the first of these two integrals by converting it into a 
surface integral and by noting that 7 = 0 on the surface. Hence 


{1.0 dV = — [xv [(a-r)j] av’ 


— [xva “r)-jdV’ — {xa -r)V’-jdVv’ 

But 
V’(a-r’) =f 

and, remembering Eq. (8-2-4), 
V’- jr’) = —ikp(’) 

We have thus 
ec “rydV’ = — [xo -j) dV’ + ik [xa -r’)p(r’) dV’ 
Generalizing to three dimensions, we conclude that 
[i “r)dV’ = — [ro -j) dV’ + ie |re -r’)p(r’) dV’ (8-3-14) 
Substituting back into Eq. (8-3-13), we find for A,, 


— i(kr — wt) 
A, (r,t) = —k? re -r’)p(r’) dV’ (8-3-15) 
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To proceed, we will take the curl of A,, and find the magnetic field. 
Again we ignore terms of order 1/r?. 


B,, — V x Arg 
ie ihre) 


= a x fro-re) dv’ (8-3-16) 


We will add to the expression on the right of Eq. (8-3-16) a term which 
is zero but which helps us put the field in a somewhat more conventional 
form. We have 


ike 7 ikkr— ot) 
7a — _ = 
The integral on the right side of Eq. (8-3-17) can now be written as the 
product of a second-rank tensor QO and the vector fi: 


B, fi x | [3r’(A-r’) — Ar’? ]p(’) dV’ (8-3-17) 


[3r'(f-r’) — fir’*]p(r’) dV’ = QA (8-3-18) 
where the elements of O are 
Qi; = ox — r? 6,,)p(') dV’ (8-3-19) 
The magnetic field B,, is thus 
-1,3,,— i(kr— ot) 
B,, = aa anal VP (8-3-20) 
6r 
As usual the electric field is 
E,, = —fl x B,, (8-3-21) 


The tensor Q is called the quadrupole tensor of the charge distribution. 
In the event that the charge distribution is cylindrically symmetrical, we 
can calculate the elements Q;, very easily. Letting Q) be the quadrupole 
moment of the charge distribution [see Eq. (2-14-20) ],’ we have 


Q;; = 0 fori + j 


(@) 3 74 
Qi = 222 = \( = 5 ) or) dv’ 


= 3a (8-3-22) 


‘Our definition of quadrupole moment differs by a factor of 2 from that occurring in many 
texts. Quite often, the quadrupole moment of a symmetrical distribution is directly defined 
as Q33. 
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Q33 = [ow — r)p(r’) dV’ 
= 205 


We can calculate the distribution of radiated power as a function 
of position as follows: 


du 7 
G he av _ Me Pe av 


ck® 


ac |a x Ofl? (8-3-23) 


Unfortunately this expression is rather complex and will not be evaluated 
in general. For the particular case of axial symmetry we have 


Qh = —Qonx8; i Qonyé, ae 2Qon,€3 (8-3-24) 
|a x Qfl? = 9n,2(n,? + n,7)Qo? = 9Qo? cos? 6 sin? 6 (8-3-25) 
6 2 
( an = met © cos? @ sin? 6 (8-3-26) 
timeav 


As a simple example of such a quadrupole radiator we consider the 
assembly of charges shown in Fig. 8-3. The charge —2q is stationary at 
the origin of our coordinate system, and two positive charges oscillate 
harmonically, each with amplitude d, about the origin. The two positive 


Fig. 8-3 As an example of a quadrupole 
radiator we consider the assembly of 
charges shown above. The negative charge 
is stationary at the origin of the coordinate 
system. The two positive charges oscillate 
harmonically with amplitude d along the 
z axis and about the origin. The radiation 
pattern is indicated. 
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charges are always on opposite sides of the origin and exchange positions 


every half-cycle. 
To calculate the quadrupole moment Q, we assume that at ¢ = 0 
the charges are at their maximum amplitude. Then 


On= | ocew?P, (cos 6’) dV’ 


= 29d" (8-3-27) 
The distribution of radiation is gotten from Eq. (8-3-26). 
du CST 8a OM 
-_ = 8-3-28 
( it i) ea cos* @ sin* 6 ( ) 


Integrating over the angle, we can find the total energy lost per unit 
time, first for the general symmetrical quadrupole and then for the special 
case we have just discussed. 


d ck°Qo? 
( 5) Z (8-3-29) 
time av 
For our special example, 
du ck®°d*q? 
Gi) = (8-3-30) 
time av 


It is instructive to compare this with the dipole radiation which would 
result if one of the positive charges and half the negative charge were re- 
moved. We would then have a dipole moment |p| = gd, and, using Eq. 
(8-2-12), we would obtain 


4 2 
G) gia (8-3-31) 
dt elect dipole 3 


Comparing directly, we have 


(a) 
dt electquadrupole __ (kd)? (8-3-32) 


(=) ~  § 
dt elect dipole 


This result is very similar to Eq. (8-3-12), indicating, as anticipated, that 
electric quadrupole and magnetic dipole radiation have the same basic 
strength. Each is weaker than electric dipole radiation by a factor about 
equal to the ratio of the square of the characteristic size of the radiator 
to the square of the wavelength of the radiation. 

Obtaining higher terms in the multipole expansion of the radiation 
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field is beyond the scope of this book. The reader is referred to advanced 
texts’ where more sophisticated mathematical methods are applied. 


8-4 WE REEXAMINE THE PASSAGE OF RADIATION THROUGH MATTER 


We will now make use of what we have learned about electric dipole radia- 
tion and reexamine some of the formalism we developed in Chap. 7 from 
a new point of view. We observe that an incoming electric field induces a 
dipole moment per unit volume in matter. This induced dipole moment, 
which has the same harmonic time dependence as the incident electric 
field, gives rise to a radiation field of its own. When we add together all 
contributions to the electric field at a point, we will again observe the 
characteristic interference which led us-to the notion of refractive index. 

We will begin by letting P(r,f) be the polarization per unit volume 
of the matter in question. Then, since we will only be concerned with oscil- 
lations having frequency w, we can write 


P(r,t) = P(rje'™ (8-4-1) 


Consider then a thin sheet of material of uniform polarization and thick- 
ness 6 (see Fig. 8-4). We wish to find the electric field at a point which lies 
a distance z fromm the sheet. To make matters simple we will assume that P 


‘For example, see J. D. Jackson, “Classical Electrodynamics,” pp. 538-577, Wiley, New 
York, 1962. 


Fig. 8-4 An incoming beam of radiation causes an induced dipole 
moment per unit volume P(‘) = Pe in a sheet of thickness 6. We are 
interested in finding the field due to this polarization at a distance z from 
the sheet. 


y out of paper 
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lies along the x axis. Later we can generalize somewhat to an arbitrary direc- 
tion for P in the xy plane. 

In carrying out the integration over the sheet, we will use the same 
technique as was applied in Sec. 7-1. We will consider the sheet.to be limited 
in extent in order that we be able to consider the projected dipole moment 
to be equal to the full dipole moment. Making use of Eq. (8-2-10), we 
integrate to find the electric field resulting from the sheet. 


e~ kR— ot) 
E,(z,t) = k?P | ay 
sheet R 
; 2n Pmax(9) ,—ikKR d 
= k*Péel | ad a 
0 0 R 


: 2n Rmax(@) ’ 
k? Pdei™ | dé | e "FE aR 
0 z 


2 


dO(e-t#Bimex0) _ g-Ike (8-4-2) 


ik Poe | 
0 


Just as we did when we dealt with Eq. (7-1-5), we ignore e7 *®==x') since 
it will average to zero. This leaves us with the simple result 


E,(z,t) = —2nikPde~i##- (8-4-3) 


We can compare this result directly with Eq. (7-1-9). The equations are 
identical if we just identify ikP with cE). To see that this is reasonable, 
let us just go back to Eqs. (8-2-1) and (8-2-5) and consider a uniform current 
distribution and a unit volume. We have then, removing the integrals, 


ikP = j (8-4-4) 


But j = oE and hence our result. 

Now it often happens that P is related to the applied field by a pro- 
portionality constant y, called the electric susceptibility. This propor- 
tionality constant which was introduced earlier for the static field will, 
in general, be a frequency-dependent second-rank tensor. In our simple 
case we have taken w to be a constant number and we assume now that P 
is due to an incoming field. 


Ee (7,1) =-eoe (8-4-5) 
Thus, taking the field at z = 0, we have 
P= YE, (8-4-6) 


Substituting back into Eq. (8-4-3), we find 
E S@i) = — tnikyeBge ae) (8-4-7) 
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Again, comparing with Eq. (7-1-9), we find 
LP =e (8-4-8) 


This means that wherever we previously had o we can now substitute 
iky. In particular, going back to Eqs. (7-2-5) and (7-2-6), we find the general 
solutions for radiation traveling within this medium in the z direction. 


exp [ —ik,/p(1 + 4 
Bae xp [—ik,/u mx)z] (8-4-9) 


exp [+ik./u(1 + 4zy)z] 


We remember now that the expression 1 + 4zy was just equal to the 
dielectric constant ¢ in the case of electrostatics. We can define the dielectric 
constant at frequency w in the same manner and thus observe that the 
refractive index can be put in terms of p and «. 


n= ./u(l + 4ny) = /ue (8-4-10) 


(Needless to say, the dielectric constant and the magnetic susceptibility 
that one must use here are not the values obtained from electrostatics and 
magnetostatics. The values we need here are to be obtained at the appropriate 
frequency w.) 


8-5 INTERFERENCE PHENOMENA FROM AN ARRAY OF DISCRETE 
DIPOLES; THE NOTION OF COHERENCE 


With this section we begin a systematic study of the interference problems 
that result when we add together the radiation from a collection of coherent 
dipoles. Before we do much adding however it is wise to define precisely 
what it is that we mean by coherence, in simple physical terms. 

Suppose for a moment that we have only two dipole oscillators 
operating at exactly the same frequency. If the phase between the oscillators 
remains absolutely fixed, then the amplitude of electric field at any point 
in space will remain constant in time. Wherever the electric fields from the 
two add constructively, we will have increased intensity. Wherever they 
add destructively, we will have decreased intensity. The main point is that 
the intensity pattern averaged over a cycle of the oscillation will not vary 
with time. These two oscillators are then completely coherent with respect 
to one another. 

Suppose, on the other hand, that the two oscillators were going very 
fitfully in stops and starts so that every so often the phase between them 
would jump. If we looked at the combined electric field from the two oscil- 
lators at any given point in space, it would vary in amplitude each time such 
a jump in phase occurred. If the jumps took place in intervals which were 
short compared with the time constant of our sensing instruments, we 
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would average over them and see no net constructive or destructive inter- 
ference, just the sum of the independent intensities. If the phase jumps 
occurred in intervals which were long compared with the instrumental 
time constant, we would still see a complete interference pattern, but its 
nature would change at each jump. In any case the extent to which we will 
say that the sources are coherent will depend on the time constant of our 
instrument as compared with the time between phase jumps. 

As an example of incoherence we might consider two independent 
monochromatic light sources. Since the light from any source is composed 
of the contribution from numerous atoms and since each atom only radiates 
for a very short time, about 107° sec, the interference pattern changes 
much too rapidly to follow with our instruments. On the other hand, if 
the atoms in the two oscillators can be driven together by having them each 
scatter the radiation from a third source, then they will act completely 
coherently. When we studied the origin of the refractive index, we observed 
exactly such a coherence among the scattering centers in our medium as 
they were driven by the incident plane wave. 

As our first exercise then we examine the interference pattern at a 
great distance from a simple linear array of N parallel dipoles spaced a 
distance d apart (see Fig. 8-5). We will concern ourselves only with the 
electric field in the plane normal to the dipoles; for convenience we will 
call this the yz plane and take our dipoles in the x direction. The z axis is 
taken perpendicular to the line of dipoles. 

We take the dipole moment of the nth dipole to be p,é,. The distance 
from the first dipole to the point at which we wish to evaluate the field is r. 
Since r >> d, we can write 


ke2e7 itkr— ot) nial ; ? 
Le gre seas (8-5-1) 
n=0 
From here on, in principle, all we need to do is add together the 
complex numbers corresponding to each of the dipoles and take the real 
part of the sum. This procedure is particularly simple if all the dipoles are 


equal. In that case 


e7 ikr-ot) N~1 : ; 
>» eikndsin® (82522) 


n=0 


2 
Bacatwnes “2 


Fig. 8-5 We are interested in 
the radiation field at a position 
(r,@) in the plane normal to an 
array of dipoles. We assume 
that r is much larger than d, the 
spacing between dipoles. 


8-5 COHERENCE 291 


We can get a good idea of what takes place here by plotting the sum in the 
complex plane. If we let « = kdsin 0, then the various terms in the sum 
are vectors of unit length, each of which is rotated at angle « relative to the 
previous term. For example, taking n = 4, we have plotted the sums in 
Fig. 8-6 for various values of «. It is clear that we have minima with FE, = 0 
ata = 7/2, 2, 3n/2, Sn/2, 3x, 7/2, etc., and maxima when « is any integral 
multiple of 2x. Generalizing to arbitrary N, we observe that the pattern 


3 
Fig. 8-6 We have plotted the sum Dy e™* for various values of a. (The dashed line indicates 
n=0 


the resultant.) We obtain maxima at « = 2mn and minima at a = 7/2, 2, 3n/2, 52/2, ... 


i 


ee 


o« = 0 or 2mz (maximum) 


Tee 
aes (minimum) 
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has a series of N — | nulls between large maxima. At these maxima, 
a = kdsin 0 = 2mn (8-5-3) 


where m is an integer. Since kK = 2n/A where / is the wavelength of the 
radiations, we can rewrite Eq. (8-5-3) into more conventional form: 


md = dsin @ (8-5-4) 


These peaks of constructive interference occur then when the path length 
difference between successive dipoles is just equal to an integral number 
of wavelengths. 

We find the intensity by squaring the magnitude of FE and multiplying 
by c/87 [see Eq. (8-2-11)]. 


ck* |p|? = ink dsin®@ 
Snr? x 


n=0 
The intensity pattern for a set of four dipoles is shown in Fig. 8-7. 
The potential significance of the analysis we have just completed is apparent. 


1(r,0) = (8-5-5) 


Fig. 8-7 A plot of relative intensity versus kdsin @ for a set of four synchronous dipole 
antennas spaced a distance d apart along a line (see Fig. 8-5). 
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We have observed that radiation of a given frequency can be made more 
and more directional by increasing the number of phased antennas involved 
in the transmission. A bit of thought will indicate that by varying the phase 
among the antennas we can vary the direction in which the radiation is 
transmitted. Thus we see that a large field of phased antennas is the natural 
way of ‘“‘beaming”’ broadcasts to specific areas around the world. 

Incidentally, in a very analogous way we can consider the direction- 
ality of a set of receiving antennas. Suppose the dipoles of Fig. 8-5 had 
been set to receive rather than transmit radiation. If they were all connected 
to a common amplifier with no phase lag among them, then they would 
add completely constructively whenever the radiation was coming from 
a direction given by Eq. (8-5-4). By varying the relative phase lags from the 
antennas to the amplifier, the directions of maximum sensitivity can be 
varied. Much of radio astronomy these days is done with large fields of 
phased-antenna arrays. 


8-6 FRAUENHOFER DIFFRACTION BY A SLIT; SCATTERING 
BY A DISK; THE DIFFRACTION GRATING 


We are all qualitatively familiar with the fact that a plane wave of light 
passing through a small hole in a wall exhibits a remarkable interference 
pattern on the far side. This pattern is generally explained in terms of the 
so-called Huygen’s principle, which tells us to consider each point on a 
wavefront as a new source of radiation and add the “radiation” from all 
of the new “sources” together. Physically this makes xo sense at all. Light 
does not emit light; only accelerating charges emit light. Thus we will begin 
by throwing out Huygen’s principle completely; later we will see that it 
actually does give the right answer for the wrong reasons. 

What happens then as our radiation strikes a wall with a hole in it? 
Referring to Fig. 8-8 we see that the radiation to the right of the wall is a 
superposition of the incoming radiation and the radiation arising from the 
oscillating dipoles in the wall. If we were to fill in the hole so as to make the 
wall complete, then nothing would penetrate the wall. That is, the complete 
wall radiates just enough to completely cancel the incoming plane wave 
to the right of it. Hence the radiation which appears when the hole is open 
must be precisely canceled out by the radiation from the stopper as we cover 
up the hole. This makes our calculation of the diffraction pattern very 
simple. We need only calculate the radiation from the stopper itself. The 
radiation field to the right of the hole is equal and opposite in amplitude 
to that which would be emitted by the stopper if it were radiating all by 
itself with the same dipole moment per unit area as the rest of the screen. 

Our job then consists of two parts. First, we must find out what the 
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Fig. 8-8 An incoming plane wave strikes an 
opaque wall with a hole in it. If the “stopper” 
were inserted into the hole, then no radiation 

V————_—— 


would penetrate. Hence the stopper radiates 
just enough to cancel the radiation which 
would be transmitted through the hole. We 
find the radiation pattern on the right by 
taking the negative of the electric field which 
would be radiated by the stopper. 


dipole moment per unit area of our wall! is in terms of the incoming radia- 
tion. Second, we can replace the combination of incoming radiation and 
wall by the “equivalent’’ stopper alone. Giving this stopper a dipole moment 
per unit area which is equal and opposite to the rest of the wall, we calculate 
its radiation pattern and have the answer to our problem. 

Let 2 be the dipole moment per unit area of our wall. If the wall 
were complete, then the field contributed by its oscillating charges [see 
Eq. (8-4-3)] would just be 


Evel. t) = —2nik p e7 itkz— at) 
if the incoming radiation had the form 
Ee t) a Beg 


Then in order that the total radiation to the right of the wall be zero, we 
must have 


poe (8-6-1) 


1We assume throughout that the radiating dipole distribution in the wall has negligible thick- 
ness. In other words, the wall is made of exceedingly opaque material. We also ignore the 
displacement current at the edge of the hole or disk which results from the discontinuity in 
the current distribution. These edge effects are only important if the hole is of the same order 
of size as the wavelength. 
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We thus replace the incoming radiation and the wall with its hole by the 
stopper having dipole moment per unit area equal to — E,/2zik. If we 
let r’ refer to a point on the aperture and r be the place at which we wish 
to know the electric field, then, using Eqs. (8-2-10) and (8-6-1), we obtain 


don | xp (—iklr — r’ 
E,(r,t) ~ —2 ; sem | eaten) oy (8-6-2) 
8 aperture lr Ar 


[We assume here that the diffraction angle is relatively small so that we 
need not worry about taking the projection of the dipole moment normal 
to the propagation direction. That is the reason for writing Eq. (8-6-2) 
as an approximation. ] 

As a simple exercise we can now calculate the diffraction pattern 
from a rectangular slit in the so-called Frauenhofer limit, where the distance 
from the slit to the point of interest r is much larger than any dimension 
of the slit itself. We set the origin of our coordinate system at the center 
of the aperture with the z axis normal to the wall and pointing in the direction 
of propagation of the incoming radiation. We take the aperture to extend 
from x = —a/2 tox = +a/2 and from y = —b/2 to y = +5/2. As before, 
the incoming radiation will be taken as polarized in the x direction. To 
simplify matters we will only consider relatively smal! angles where both 
x and y are quite a bit smaller than z, making Eq. (8-6-2) valid. We have 
then 


a/2 b/2 3 ogee 
ag | jy ae 3) 


—h/2 lr —r'| 
We can take the denominator out of the integral, since it varies very little 


over the aperture. We can also approximate |r — r’| using Eq. (8-1-4). 
We have then 


IkKEy , 
E,(x,y,2,0) = = em | 


—a/2 


. -ilkr—ot) fa/2 b/2 or’ 
1S (ey cer ikEge ax | dy’ exp (i t) (8-6-4) 


2nr -a/2 —b/2 


e 


The integral splits very simply into the product of an integral over x’ and 
one over y’. Thus we have 


ikE, e7 itkr—at) 0 
Eevee h) = wae eik(sin 0x)x dx’ 


2ur a2 
( | eik(sin By)y’ ay) (8-6-5) 
= byf2 


where 


: x : y 
sin 8, = — and sin 9, = a 
r 
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Integrating, we obtain 


iabk Eye”) 


2ur 

sin (« > sin ) sin (i - sin 6, 
SS ee (8-6-6) 
4 Z 
_ Before we go on to calculate the intensity of the diffracted radiation, 
it is well to make some interesting observations. Note the phase of 90° 
between the directly forward-going diffracted field (0, = 6, = 0) and the 
field that would be present there if the wall were removed. Note also that 
the amplitude of the directly forward-going diffracted field varies inversely 
as the wavelength of the incoming radiation. The longer the wavelength, 
the wider will be the diffraction pattern and hence the weaker will be the 

intensity at the center. 

It is convenient to express the intensity of the diffraction pattern 
in terms of the amount of energy per unit solid angle at emission angles 
6, and 6,. Since the Poynting vector determines the energy passing a given 
area per unit time, we can write 


du “ r 
e =) timeray iil (s *) time av 


sn*(k > sin s sin?( k a sin 6, 


EQ 2 


sin 0, k oe sin 0, 


= |, —-——_—_——>-_ —— + 8-6-7 
: (i $ sin a.) (« ~ sin 6) — 
where 
Io = intensity per unit solid angle in forward direction 
ca? b* k? E,? 
— ese (8-6-8) 


a 
2 
in Fig. 8-9. Note that the minima occur at angles such that 


A plot of the intensity pattern versus k — sin 0, for 6, = 0 is shown 


: 2ma mA 2nn ond 
(4) = = -_— i = = -P- 
sin 6, or sin 0, kb b (8-6-9) 


where m and n are integers and A is the wavelength of the radiation. 
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Fig. 8-9 A plot of intensity versus (ka/2)sin 0, for a slit of width a. 


We will not go through the trouble of integrating the intensity over 
angle to find out how much of the incident energy makes it through the 
hole. Needless to say, all the energy which strikes the hole makes it through 
and contributes to the diffraction patiern. If we carried out the integral 
for any shape aperture, we would find 

d*u ey ee - 
7" (ara) ins = <incident energy/unit area/unit time) ime av 


angles (area of aperture) (8-6-10) 


Suppose that we replaced the wall with a hole in it (of arbitrary shape) 
by a disk which precisely matches the aperture (see Fig. 8-8). Now only a 
very small amount of energy is absorbed in the opaque disk and most of the 
radiation continues on as before in the plane wave. 

To calculate the radiation pattern we need only ascribe to the disk 
a dipole moment per unit area equal to E)/2nik [see Eq. (8-6-1)]. The 
total field is then obtained by summing the incident field and the field 
produced by the disk. 


i IkE, . — ik ee? 
E,(x,y,2,1) & Eye it#-o) — 0 got | exp(—ik|r ~ ¥')) 44, 
disk 


2n r—r 


: iIkE e7 itkr—ot) rer’ 
~ —i(kz—ot) __ Lg ag e ik dA’ 
= Ege a ip xp ( ; ) 


(8-6-11) 
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It is interesting at this point to determine the forward-scattering 
amplitude of the disk. Back on page 257 we defined the forward-scattering 
amplitude f(0) for a single scatterer. The definition here is identical; we 
take the ratio of the amplitude of forwardly emitted radiation at z = 1 cm 
to the amplitude that the incident radiation would have had at the same 
point if there were no scatterer. Hence we have in this case 


f)=- =| da’ 


=— — (area of disk) (8-6-12) 


Taking another look at Eq. (8-6-11), we see that the scattered radiation 
is identical except for a minus sign to that which we obtained when we had 
a hole rather than its complementing disk [see Eq. (8-6-2) ]. Hence, making 
use of Eq. (8-6-10), we conclude that 


Oetastic = elastic scattering cross section’ of disk 


area of disk (8-6-13) 


Hl 


Since the disk absorbs all the energy which is incident on it, we can also 
write 


Cay; = area of disk (8-6-14) 
Summing the two, we have the total cross section. 
Orotat = 2 (area of disk) (8-6-15) 


We can rewrite Eq. (8-6-12) in terms of the total cross section, in a somewhat 
weaker form than we have it. As we shall see, this will be a result which will 
prove to be independent of the opacity of the disk. 


Im f(0) = — vu (8-6-16) 

Summarizing then, we have shown that the elastic scattering cross 
section and the absorption cross section of an opaque disk are identical. 
Furthermore, we have related the total cross section to the imaginary part 
of the forward-scattering amplitude. (In the case of an opaque disk the 
forward-scattering amplitude turned out to be completely imaginary.) 
These results will turn out to be of substantial importance when we study 
the scattering and absorption of elementary particles quantum mechanically. 
Indeed, the relation between the imaginary part of the forward-scattering 


1For the definition of cross section see page 223. 
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amplitude and the total cross section goes under the name of the optical 
theorem in many quantum-mechanics texts. (The sign is often reversed 
because the sign convention for i is taken differently.) 

What, we might ask, would be the situation if the disk had not been 
completely opaque? To answer that we first go back to our complete wall 
which is radiating the plane wave represented by Eq. (8-4-3). Again, for 
convenience, we let » be the dipole moment per unit area (= Pd). In 
the case of an opaque wall we found that p= E,/2nik [see Eq. (8-6-1)]. 
We will modify this result by letting 


i nEo 
2nik 


(8-6-17) 


The electric field to the right of the wall (see Fig. 8-4) would then be 
E{z,t) = Eg — ne" ~—s forz > 0 (8-6-18) 
The fraction F of the incoming energy which is transmitted through the 
wall is just given by 
F =| —n/? (8-6-19) 


If we now make up a disk having area A of this nonopaque material, 
we would anticipate its absorption cross section o,,, to be given by 


Sars = (l at F)A (8-6-20) 


The outgoing radiation to the right of our disk would be 


i ~i(kz-ot) _ NIKE | itkr-at) tor dA’ 
E(x, y,2,t) = Ege ay e (ie exp (u : 
(8-6-21) 
The forward-scattering amplitude would be 
foy=*— me A : (8-6-22) 


Finally, the elastic cross section would be \n|? times the elastic cross section 
for an opaque disk of the same area. Hence, referring to Eq. (8-6-13), we have 


Feinstic = |"|"A (8-6-23) 
Summing the elastic and absorption cross section to find the total cross 
section, we obtain 

Cro = (1 — {1 mas n|? te \n|).A 

= 2A Ren (8-6-24) 
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Hence, the so-called optical theorem works for a nonopaque disk: 


Im f(0) = — —Ren 


(8-6-25) 


This general result was derived subject to the small-angle approxi- 
mation which we have been making all along. It happens however to be 
independent of this approximation and is completely true in general. As 
mentioned before it plays a very important role in studying the quantum- 
mechanical scattering of elementary particles. 

We now return to our wall with its aperture and replace our single 
slit with a series of N parallel slits each of height a and width b. The arrange- 
ment of the slits will be as shown in Fig. 8-10. We take the origin of our 
coordinate system at the center of the first slit. All slits extend from x = 
—a/2 to x = +a/2. The first slit extends from y = —b/2 to y = b/2. The 
nth slit then extends from y = (n — 1)d — b/2 toy = (n — 1)d + 5/2. We 
calculate the electric field exactly as we did in the case of a single slit except 


Fig. 8-10 We investigate the diffraction-interference pattern caused by 
a series of N slits each of width b that are spaced a distance d apart. 


iol of coordinate system 


bt 


| 
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that the integral over y’ consists of N different parts. 


ikE e  ilkr—ot) 'a/2 am 
j Sel 6 a eg ty av aiid |" eik(sin 05.)x dx’ 


2ar ~a/2 


N (n-—1)d+b/2 — 
( L \ ills “y) (8-6-26) 


n=1 J(n—1)d—b/2 


Our expression can be simplified somewhat by noting that 


(n— 1)d + b/2 b/2 
eik(sin Oy)y’ dy’ = eikin— 1)d sin 6, | eik(sin Oy)u du (8-6-27) 
( 


n—-1)d—b/2 Ife 
Substituting back and integrating, we find an expression for E,. 


: : ; b., 
iabE e720) sin (x + sin ,) sin (« Si sin 6, 
aS 


_2nr 


(X,),2,1) = 


any b. 
k > sin 6, k > sin 6, 


gikin —1)d sin 6 (8-6-28) 


Mz 


If we go back and compare this expression with Eqs. (8-6-2) and 
(8-6-6), we come to a very interesting conclusion. The intensity pattern 
arising from’a set of N slits spaced a distance d apart can be arrived at by 
multiplying the intensity pattern for a single slit by the intensity pattern for 
N dipole antennas with the same spacing. 


PROBLEMS 


8-1. A dipole radiator (see Fig. 8-1) is set parallel to a perfectly conducting wall. The 
distance from the dipole to the wall is 4/2 where / is the wavelength of the emitted 
radiation. What is the intensity distribution as a function of @ in a plane per- 
pendicular to the dipole at a long distance away from it? 


moment 
into paper 
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8-3. 


8-4. 
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Suppose the electric dipole radiator of Prob. 8-1 were replaced with a magnetic 
dipole radiator (see Fig. 8-2) whose moment was oriented in the same direction. 


How would the intensity distribution change, if at all? 


Consider the dipoles of Probs. 8-1 and 8-2 turned so as to have their moments 
perpendicular to the conducting plane. Evaluate the radiated intensity as a 
function of 6 at a long distance away for each of the two cases. 


A classical atom is made up of two electrons traveling in a common circle of 
radius a about a helium nucleus. (The orbits of the two electrons coincide.) 
Assume the frequency of rotation to be w where w << 2nc/a. 


- (a) Find the radiation pattern if the two electrons are always on opposite sides 
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of the circle from one another and are following one another around. How 
much energy is radiated per unit time? 


(b) 


(c) 


(b) How much energy would be radiated per unit time if the two electrons moved 
together as one doubly charged object? 

(c) How much energy would be radiated per unit time if the two electrons traveled 
in opposite directions about the circle? 


Calculate the Frauenhofer diffraction pattern that would result from a circular 
aperture of diameter D illuminated by radiation of wavelength 4. Sketch the 
pattern for the case where D = A. 
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8-6. A beam of light of wavelength 5 x 107° cm is normally incident upon a square 


8-7. 


glass plate of thickness 10~* cm and 1073 cm ona side. The glass is ideal, com- 

pletely nonabsorbing, and has refractive index 1.5. 

(a) What is the scattering cross section of the glass plate? That is, what fraction 
of the incident energy per square centimeter is removed from the beam and 
reradiated? 

(b) What is the angular distribution of the reradiated light? 

(c) What are the real and imaginary parts of the forward-scattering amplitude 
of the plate as a whole? 

(d) Suppose the phase of the scattered radiation could somehow be shifted 
relative to the incident radiation by 2/2. What would the apparent absorption 
cross section of the plate be to an observer who was measuring its forward- 
scattering amplitude? Can you think of how to produce this shift? You 
would be reinventing the phase-contrast microscope. 


As we fly at 30,000 ft, most of the atmosphere is below us. Why is it then that 
the air above us seems so much bluer than the air below us? Would the sky be 
blue if the density of the atmosphere were uniform throughout and the same as 
it is at sea level? 


9 
Waveguides and Cavities 


One of the more useful applications of what we have just learned about the 
reflection of radiation at a conducting surface is in the field of waveguides 
and cavities. A waveguide is a rather simple device. Imagine that we have a 
long, hollow tube which is a perfect mirror on the inside. If we put a light 
bulb at one end, some light will obviously be carried to the other end. 
Hence the pipe or waveguide is a means of transferring radiant energy from 
one place to another with minimal attenuation. 

Now in the pipe we have just described, we tacitly assumed that the 
pipe diameter was very much larger than the wavelength of the light. Hence 
we did not need to worry about any coherent interference effects as the light 
bounced back and forth across the pipe. If we were to take the wavelength 
to be comparable to the transverse dimension of the waveguide, we would 
observe a variety of such effects, and these indeed are largely the subjects 
of this chapter. 
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Now to make our work simple, we will limit our considerations to 
rectangular waveguides and cavities. (A cavity is a length of waveguide 
which is capped off at each end.) Without much physical difficulty, but with 
considerably mathematical difficulty, other shapes can be considered by 
the reader. We will begin by assuming the walls of our waveguide to be 
perfectly conducting. Later we will consider the effect of having only a 
finite conductivity. 
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Consider then a straight pipe with rectangular cross section whose height 
and width are a and b, respectively. We will set up our coordinate system 
at some point in the waveguide, as shown in Fig. 9-1, with the z axis pointing 
in the direction in which we want our radiation to propagate. The inside 
of the waveguide extends from x = 0 to x = a and fom y= 0toy=b. 
Now we could begin by writing down a plane. wave, letting it reflect 
each time it reaches a surface, and then adding everything together. It is 
however much simpler to start from scratch and look for solutions to 
Maxwell’s equations subject to the boundary condition that the tangential 
component of E be zero at the surface. We have 


vxBeaw 2 
c Ot 
and 
“woe = LS 
(ee (hi 


Taking the curl of the second equation, we obtain 
Vx(Vx E)=V(V-E)- VE 
1 0 1 @E 


ee ee 2 oe 


Fig. 9-1 Segment of a rectangular waveguide of cross section a x b. 
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But V- E = 0. Hence 


1 OE 
Similarly 
1 3B 
V’B cad a ru = 0 (9-1-2) 


These wave equations hold for each of the components of E and B indepen- 
dently. For example, we can write 


1 ORE!) 


2 = 
i BLOG 525 t) ce ar 


0 (9-1-3) 


Now we can search for a solution to this equation of the form 
E,(x,y,2,0) = E, QE, O)E,,(2)£.,0 


(It is possible to show that the solutions of this form make up a complete 
set; any solution to the problem can be expressed in terms of them.) Sub- 
stituting back, we have 
OY ae |. WidteS 1, abe 
T2069 Va oa Ey) dy? EX) dz 


alae 
CE) de 


Since each term in this sum involves a different and independent variable, 
we can set each equal to a constant. 


=0 (9-1-4) 


Gone Cg 
dx? a —k,,’E,, dz i —k3,7E,., 
(9-1-5) 
CD Oe ais 
dy? = —k,,’E,, dt? =S —c?k*E,, 

As usual we define w = ck. In order that Eq. (9-1-4) hold, we must have 
hag Se hoy ar bee” = k? (9-1-6) 
Now the solutions for E,, and E,, are of the form 
] am = Ai cos kx aia By sin Ky 3x 

(9-1-7) 


Ey, = Az, Cosk2,;y + By, sink yy 


In order that the radiation be traveling in the direction of increasing z, 
we need only make use of those products of E,, and E,, which combine 
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k3,z with —wt and leave out the ones which combine k,,z with +cat. 
That is to say, we keep only the terms leading to sin(k,,z — wt) and 
cos(k3,z — wt) among the various products of sin k3,z, cos k3,Z, sin wt, 
and coswt. We throw away the alternative terms sin(k,,z + wt) and 
cos(k3,z + wt). Now we can represent an arbitrary combination of sin 
(k3,z — wt) and cos(k3, — wt) by taking the real part of E),e7 317-0 
and allowing Eo, to be a complex number. We have then 


E(%,y,2,t) = Eo,(A11 cos ky,x + B,, sin k,,x)(A2, cos ky,y 
+ B,, sin k,,y) e7 317-2) (9-1-8) 
Similarly 
E,(x,y,2,t) = Eoy(Ai2 cos kx + By sin ky2x)(A22 cos k22y 
+ By, sin ky2y) e7 *327-)  (9-1-9) 
E,(x,y,2,0) = Eo,(A13 CoS ky3x + By3 sin k,3x)(A23 cos k23y 
+B, sin kysy)e "7-™ ~(9-1-10) 


The boundary conditions on the tangential components of E at x = y = 0 
tell us that 


Ay, = Aj. = Ay3 = A,3 = 0 (9-1-11) 


The requirement that V-E = 0 everywhere within the waveguide tells 
us that 


B,, = B,, =0 


kia = ky. = ki3 (=k by definition) 
ka — k22 = k23 (=k, by definition) 
ks, = Ks. = ks33 (=k, by definition). 


In addition, if we incorporate the remaining ES constants into the 
definitions of Eo,, E,, and Eo,, we have 


ky, Eo, + k2Eoy + ik3Eo, = 0 Fer) 
where 
E, = Eo, cos k,x sin k,y e7 37-9 
E, = Eo, sin k,x cos kay e7 thse 20 ae 
— i(k 3z— wt) 


ge = EQ sin kx sin kiy ig 
Finally the requirements that E, = E, = 0 at y = band that FE, = E, = 0 
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at x = a yield the restrictions 
eee (9-1-14) 


where 7 and m are integers. To find the magnetic field components B,, 
B,, and B, we take the curl of E and remember that 
1 OB 
Vx E= —-— eB 
c Ot 
Integrating the curl of E with respect to time and multiplying by —c, we 
obtain 


3 = — (nm, 2 ik, Eo )siirk eek, yer a 
(63) 
B,= — ~ (ks Eos + ikyEo,) cos k,x sin ky e7#-") == (9-115) 


B, = = (k, Eoy — k2Eo,) cos k,x cos kzy e7 37-9) 


We notice incidentally that V - B = 0 as expected. 

Now because of the constraint that k, Eo, + k,Eo, + ik3Eo, = 0, 
we cannot choose Eo,, Eo,, and Eo, arbitrarily. It is convenient to define 
two different classes of solutions for a given choice of the integers and m. 
One, called the transverse electric mode (TE), has E,, = 0. The other, called 
the transverse magnetic mode (TM), has By, = 0. Hence, for the latter, 
kK, Eoy — k2 Eo, = 0. 

We can easily show that any possible solution to the problem for a 
given m,n, and w can be written in terms of TE and TM modes. If E,,™, 
Eoy®, Eox'™, Eoy™, Eo,™ represent the coefficients corresponding to 
those modes, we require only that 


Eo, = Eo,'* + Eo,™ 
Eoy = Eoy + Eo, (9-1-16) 
Eo, = Eo. 

In addition, we have the two conditions that 
kg Box + Kk, Eo," = 0 (9-1-17) 
ko, — ksEo.% = 0 

Solving these five equations for the five unknowns, we have 


ky Eox a ky k,Eoy 


E TE Ss 
Ox Ree - hee 


9-1 THE PERFECTLY CONDUCTING, RECTANGULAR WAVEGUIDE 309 


aia ee om ky k, Eo. 


E Lee 
mY ky? + ky? 
E.™— ky7Eo, + kk, Eo, 
a kerk 
E.™ — ky’ Eoy + ky ky Box 
ky? + k,? 
ae = Eo. 
Incidentally the requirement that Eo,"*/E,,"* = —k,/k, establishes that 


Eo,'* and Eo,"* have the same complex phase. The same is true of E),™ 
and £,"™. However Eo,™ = i(k, Eo," + k,Eo,™)/k3 and is 90° out of 
phase with either Ey,™ or Eo,™. 

It is instructive at this point to have a look at some of the simpler modes. 
A mode will be written as TE,,,, or TM,,,,, depending on its type and the 
values of the integers m and n. The simplest is clearly TE9,, for which we 
have 


7 
hen )) k, = — 

1 7, 
The requirement that k,Eo,"* + k,Eo,"* = 0 permits us to conclude that 
in this case Ey, = 0. Hence 


E, = Eo, sin =) e7 ilksz— ot) 
E, it; E, = 0 
a) 
a _ 
B, = cs Eo, sin 7 e7 ilksz — at) 
k ‘ 
B, a a ioe sin = e” ilksz— et) 


If we now assume Ep, to be real, we can find the fields as they actually exist 
within the waveguide by taking the real parts of these equations. 


ES Ea sin = cos (k3z — wt) 


k att 
B, = 7 Eo, sin Te cos (kz — wt) (9-1-19) 
k, ae 
B, = — — Eo, cos —y sin (k3z — wt) 


ss k b 
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We see that the electric field just runs across from the plane at x = 0 to the 
plane at x = a. The magnetic field, on the other hand, appears to go in 
closed loops centered about the points where the electric field is zero. We 
indicate the directions of the field lines in Fig. 9-2. Solid lines represent 
electric field and dashed lines represent magnetic field. 

The logical mode to examine next would be the TMy,. We note how- 
ever that this mode as well as all modes of the form TM,,o or TMo,, are 
zero by virtue of the fact that k,Eo, = k,E , = 0 for these modes (since 
either k, or k, is zero). This means that the lowest frequency which can 
pass through the waveguide is in the transverse electric TE), (or TE;9) 
mode 


1 
— fora>b 
a 


min 


ee forb<a 


This mode, called the dominant mode, is the important one in the transmission 
of microwave power. At any frequency below the minimum value, k, will 
be imaginary and hence the radiation will be attenuated exponentially in 
going down the waveguide. 

When we examine the velocity at which radiation travels down the 
waveguide, our first reaction is that it appears to be w/k,, which is greater 
than c (since k; < k). To understand this it is convenient to think of the 
entire process as composed of the superposition of waves being reflected 
back and forth from the walls. We illustrate this process in Fig. 9-3 where 
we show only one set of wavefronts (ignoring the reflected ones to avoid 
confusion). We see that in one period t the wavefront will move a distance 
ct at right angles to itself. The observer will think that the wave has moved 
a distance v,t downstream along the waveguide, where v, = c/sin 0. The 


Fig. 9-2 Sketch of the field lines in the mode 
TEo:- 
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Fig. 9-3 Diagram of one set of wavefronts, illustrating the phase and 
group velocities and their relationship with c. 


rate at which information moves down the waveguide is just c sin 0 = 0,. 
Hence 


v,v, = c? (9-1-20) 


We observe from our mathematical expressions for E,, E,, and E, that 
v, = w/k;. The group velocity can also be obtained, as before, by dif- 
ferentiating w with respect to k;. We have 


(dei? ar ks? ae kee" = a? 


and hence 
Ogee: 
is Oe am” 


as expected. To see if all this makes sense we can transform into a frame of 
reference moving with the group velocity v, down the waveguide. We should 
then have a situation where the solutions for E,, E,, and E, should not 
depend upon z. We first examine the term k3z — wt. Our Lorentz trans- 
formation yields 


z= y(z + Bet’) 


iG +f," 


In our case 


k; @ 


——() and = —— 
PB @ CY? +k," 
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We have then 
; peeled 
k,z = y(k3z’) + ————— 
5 y¢ 3 ) ky? ee 
w*ct’ 


Ieee) == 
(60) y(k3z') ct /k,? + ky? 


Hence 
: k3z -—-ot= Be A + k,?ct’ (9-1-21) 


We see then that the term e~ "37-9 becomes exp (i,/k,? + k,” ct’) and 
the z’ dependence falls out. Since x = x’ and y = y’, the dependence on 
the x and y directions remains unchanged. The coefficients Eo,, Eoy, Eoz; 
Box, Boy, Boz are, of course, transformed like the appropriate elements 
of a second-rank tensor. Inasmuch as longitudinal fields transform into 
themselves under this transformation, the two sets of modes TE,,,, and 
TM,,,, retain their characters. 
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If we take a segment of waveguide of length d and cap it off with a sheet 
of conductor at each end, we have a rectangular cavity. We will let the 
cross-sectional dimensions of the waveguide segment be a and b as before 
and assume for the moment that the walls of the cavity are perfectly 
conducting. 

Physically we can now think of the radiation as coming to one end 
of the cavity, reflecting back, coming to the other end, and reflecting again. 
If we choose the frequency right, the radiation will add constructively each 
time around, leading, as we shall see, to a standing wave. If the frequency 
is ever so slightly off resonance, then each reflection will throw the wave 
a bit more out of phase with the original wave, and the net result will be 
the destructive cancellation of the radiation by itself. 

If we had considered a real cavity, on the other hand, rather than 
an ideal one, the radiation would be diminished in amplitude at each suc- 
cessive reflection. Hence, even if it was a bit out of phase after the reflection 
which brings it back to the original position, it still could never interfere 
completely destructively. We expect then that the resonance width, the 
frequency range over which the cavity resonates, will depend on the absorp- 
tion of radiation by its walls. 

At the moment though let us stick with our perfect walls and find 
the solutions to Maxwell’s equations within the cavity subject to the same 
boundary conditions as before—the tangential components of the electric 
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field are zero at the walls. As before we can search for solutions which are 
products of separate terms, each depending on one coordinate only. We 
follow the same procedures as we followed in dealing with waveguides 
and obtain 


E,, = Eo, cos k,x sin ky sin kz e 
E, = Eo, sin k,x cos k,y sin k3z e (9-2-1) 


E, = Eo, sin k,x sin k,y cos k3z e 


with 
ae 
ke + kee + ka? = a = k? (9-2-2) 
and 
In mn nt 
k, = = k, = UB. k, => ae (9-2-3) 


where /, m, and 7 are integers. For convenience we can define the two vectors 
k and E, as 


k = (ky, ky, ks) (9-2-4) 
Fy = (Epox; Eoy, Eoz) (9-2-5) 
The requirement that V- E = 0 leads to the restriction on Ey given by 
k-E, = 0 (9-2-6) 
We find the magnetic field as before by taking V x E and integrating with 
. 1 6B 
respect to time (v x E= - oF a) 
a ae (cucu )asin kexcosk,y cos kyz e'* 
7) 
B, = aa (k x Eo), cos k,x sin ky cos k3z e' (9-2-7) 
(62) 
Bee oe (k x E,), cos k,x cos k,y sin k,z e' 
7) 


Now, since E, must lie at right angles to k [see Eq. (9-2-6) ], we can choose 
two standard vectors E,™ and E,'* and express any solution in terms of 
them. Again we define E,'¥ so as to have no z component. Thus 


ky Eo,7 + kyEo™ = 0 (9-2-8) 
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We define E,™ so as to lead to no z component of magnetic field. That is, 
er, = 0 
or 
k, Boy" — kgdg,"™ = 0 (9-2-9) 
We can see immediately that E)™ and E,™ are normal to each other: 


TE ™ TER ™M _ TE TM 
Eo, Eox + Eq, Loy = Hox Loy 


Since E),"® = 0, we have 


E,'!-Es™ = 0 (9-2-10) 


Incidentally our choice of the normal modes was somewhat arbitrary. We 
chose to give preference to the z axis, but we could just as easily have chosen 
the x or y axes. Obviously the normal modes obtained if we had used the 
x or y axes as our preferred direction can be expressed in terms of the normal 
modes we have just obtained. We see then that the system has two normal 
modes for each allowed frequency w and an infinite number of possible 
solutions for w. 

Again we will label the modes as TE,,,, and TM,,,,,, respectively. 
Now for any mode we have either 


kee, * + kobe = 0 or Kies, — Kaba = 0 


In either case Ey, and Ep, are in phase with each other because k, and 
k, are real numbers. In the case of a transverse magnetic mode we can 
write k,Eo,™ + k,Eoy™ + k3Eo,'" = 0, and hence E,™ is also in 
phase with E),™ and Ey,™. 

As a result we see that the electric and magnetic fields are 90° out of 
phase with respect to each other if the cavity is resonating in one of its 
modes. When the electric fields are at their maximum, the magnetic fields 
are zero, and vice versa. The energy stored in the fields thus is alternately 
electric and magnetic. To illustrate this, let us take the simplest mode 
corresponding to the lowest frequency. If we assume the dimensions a, }, 
and d are such that a < 6 < d, then the minimum frequency corresponds 
to/ = 0, m = 1, n = 1. This frequency, wo,,, is just given by 


= cull ZY +(e (9-2-11) 
011 b a oe 


Only the TE,,,; mode exists for this frequency. If we take the real parts of 
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E,, B,, B,, we have the following electromagnetic fields for this mode. 


et ae: 
E, = Eo, sin —y sin —z cos wt 


b d 
E, = E, =0 
omit (9-2-12) 
B, = — seer sin any cos —-z sin wt 
ie b d 
jee = Ee cos =) sin 6 sin wt 


Figure 9-4 shows the electric fields at time t = 0 and the magnetic fields 
which follow a quarter cycle later. As we can see the charge will oscillate 
from one side of the box to the other, just as in the case of a capacitor shunted 
by a perfect inductance. As we expect, the magnetic field reaches its maximum 
at the time of maximum current flow, when the electric field is zero. 

Other modes will, of course, correspond to different and more com- 
plicated charge distributions oscillating about the cavity. The principle 
is the same although the frequencies are higher. 


9-3 LOSS IN THE CAVITY WALLS; THE NOTION OF @ IN GENERAL 
AND AS APPLIED TO OUR CAVITY 


We begin by making some remarks about a resonant system in general. 
For any resonant system which is oscillating by itself, there is a given amount 
of energy loss per cycle which is often proportional to the energy stored. 
For example, in the case of our cavity, the energy loss per cycle is propor- 
tional to the average component of the Poynting vector normal to the cavity 
wall, which is in turn proportional to the energy density of the electro- 
magnetic field. A mass vibrating at the end of a spring and being acted 
upon by a resistive force proportional to its velocity has an amplitude which 


Fig. 9-4 The lowest mode TE ),, in 
cavity of dimensions a x b x d(a <b 
<c). Solid lines represent electric 
field. Dashed lines represent magnetic 
fields a quarter cycle later. 
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diminishes exponentially and hence loses energy at a rate proportional to 
its stored energy. For all systems of this sort we can define a parameter 
called the Q of the system as follows: 


energy stored in system 


a 9-3-1 
g energy lost per cycle ( ) 
If U is the energy stored in the system, we can write 
OY aid ey, (9-3-2) 
dat ~ 2nO 


where @, is the resonant frequency. Hence 
U = Une (olen) (9-3-3) 


If the energy stored is proportional to the square of the amplitude of oscil- 
lation, as is most often the case, then the amplitude will diminish as 
e~ (0/4"2" Thus, if A represents the coordinate which is oscillating (the 
electric field in the case of a cavity), we can write 


A = Ao en (wo/42Q)t + iwot (9-3-4) 


We can ask now for the frequencies which are present if we Fourier 
analyze this oscillation. We have then (assuming the oscillation to begin at 
b= 0) 


A(t) = — | ~ g(w)e'" dw 


JI 
Ao 
/ att Jo 
= — Ao 
~ \/2n[—a/4n@ + (wo — @)] 


The energy density per unit frequency interval is proportional to |g(w)|”, 
which turns out to be 


e- (@0/47Q + iwo — iw)t dt 


g(w) = 


(9-3-5) 


The 
2n[(@o — @)? + (@o/42Q)*] 
The full width at half-maximum of the resonance curve is just given by 
Mo 


Ao = = : (9-3-7) 


|g(@)|? = (9-3-6) 


This is more or less what we anticipated on the basis of the arguments we 
made earlier about reflection and absorption within our cavity. The larger Q 
is, the sharper our resonance. 

It remains for us to evaluate the Q of a cavity whose walls have con- 
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ductivity o. This is actually quite simple if we remember that the tangential 
components of both E and B are continuous as we cross into the metal. 
We remember also that B, (the tangential component of B at the surface) 
remains essentially unaltered as we reduce the conductivity from infinity 
to some finite value, as long as ./k/820 is much less than | [see page 243 
and in particular Eqs. (7-1-36) and (7-1-37)]. On the other hand, E,, which 
is zero for infinite conductivity, increases in proportion to ./k/no as we 
reduce the conductivity. 

To find an exact expression for E, in terms of B,, we can make use 
of Maxwell’s equation for conducting material and insert the appropriate 
time dependence. 


1 OE 
V x B= 4ncE + ro in (4x0 + ik)E (9-3-8) 


We have already solved the problem however if we just go back to 
Eqs. (7-1-36) and (7-1-37). Letting f be a unit normal vector pointing into 
the cavity wall, we can write 


eee Pr: ‘ 


To find the energy leaving the cavity per cycle we must evaluate the 
normal component of the Poynting vector along the cavity wall and average 
it over time. Referring back to Eq. (8-2-11), we have 


(S)iimeay = go Re(E x B’) (8-2-11) 


Applying this result to Eq. (9-3-9), we find that 


c k in 
CS: ftimeav = “gail a (|B, |? e'”’*) 


eee k 
~ 8x 2160 


Let us return to Eq. (9-2-7) and evaluate the above expression for one 
wall at a time. For convenience we make the following notational changes: 


|B, |? (9-3-10) 


= (k x Eo), 
Box ‘- k 
E 
a= aoe (9-3-1) 
Boz = (k x Eo). 


k 
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For the wall at x = 0 we have then 


Bo ay) 
jot 


B, = iBo, sin k,y cos k3z e 


iot 


ty 
| 


= iBy, cos ky sin k3z e 
and 
[By |? = |B, + |B,|? 
= Bo,” sin? ky cos? k3z + Bo,” cos” kay sin’ k3z (9-3-12) 
If we integrate the Poynting vector over the wall at x = 0, we find 
(=) at —_ (Boy’ + Bo,”)bd (9-3-13) 


dt through wall 320 
atx=0 


The total energy lost per unit time (averaged over time) is thus 


du Cc k 
( i. ~  16n [(Boy” + Boz”) bd + (Box” + Boy”) ab 


dt 6n\ 210 
all walls at (Be? a Ee-) ad| (9-3-14) 
The energy lost per cycle is then 
Au 1 


= — — = [4By,2(b + d) + bBo (a + d) 


cycle 8./ 2nok 


+ dBo,2(a + by] (9-3-15) 


To complete our calculation of the Q of our rectangular cavity, we must 
know the total energy u stored within the cavity. We can best evaluate this 
energy at the time when the electric fields are all zero and the magnetic 
fields are at their maximum. At that-moment we have 


B, = Bo, sin k,x cos ky cos k3z 
B, = Bo, cos k,x sin k2y cos k3z (9-3-16) 
B, = Bo, cos k,x cos k3y sin k3z 


The energy stored at that moment per unit volume is 
1 ‘a b ‘d 
u=——| dx | dy| dz(B,? + B,? + B,’) 
82 Jo 0 0 


= 64 (Box? a Boy” as Bo,”) (9-3-17) 
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Finally dividing Eq. (8-3-18) by Eq. (8-3-16), we have 


abd(Bo,? + Boy? + Boz’) 


920 = 
m0 46[aBo,7(b + d) + bBo,*(a + d) + dBy,*(a + b)] 


(9-3-18) 


where 6 = skin depth = 1/,/2zo0k. This simplifies considerably in the case 
of a cubical cavity where a = b = d = L. In that case 


18 
MO = — 3: 
es (9-3-19) 


The important thing to note is that 2xQ is approximately equal to 
the ratio of cavity volume to the volume corresponding to the skin depth. 
Over a wide range of frequencies this ratio increases as the square root 
of the frequency. Typical values of Q in the microwave region for silver 
are about 10%. 


PROBLEMS 


9-1. A long perfectly conducting waveguide of cross-sectional area 5 cm by 10 cm 
carries microwave energy in the dominant TE), mode. Keeping the maximum 
amplitude of electric field fixed, find and graph an expression for the trans- 
mitted power as a function of frequency. Does the asymptotic behavior as 
@ — oo make sense? 


9-2. Suppose we now cap off a section of the waveguide described in Prob. 9-1 by 
means of a plate having finite conductivity o. We introduce energy into the 
open end in the TE), mode at frequency w. What is the fraction of the incoming 
energy that is absorbed by the end plate? 


9-3. Suppose the entire waveguide described in Prob. 9-1 were made of copper. 
How would radiation of frequency w be attenuated in the TE), mode as a 
function of distance along the waveguide? Suppose the frequency w were 
sufficiently large so as to allow propagation in the TM,, mode. Would the 
attenuation be any different from that in the TE), mode at the same frequency? 


9-4. A rectangular cavity with perfectly conducting walls has dimensions of 2 cm by 
4 cm by 8 cm. 
(a) What is the lowest resonant frequency at which this cavity will oscillate? 
(b) Suppose the cavity is opened to air. Estimate the change in its resonant 
frequency. 
(c) What would the Q of the cavity become if its 2 cm by 4 cm faces were 
made of copper? 


9-5. A waveguide is made of two perfectly conducting coaxial cylinders with the 
radiation propagating in the space between them. Show that it is possible to 
have a mode in which both the electric and magnetic fields are perpendicular 
to the axis of the cylinder (transverse electric and magnetic mode—TEM). 
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Is there a cutoff frequency for this mode? What is the velocity of propagation 
of this mode? 


Z 


@ 


10 
Electric and Magnetic 
Susceptibility 


In earlier chapters we studied the effects of the macroscopic electric and 
magnetic properties of matter on the electric and magnetic field distribu- 
tions. In this chapter we will try to explain, at least in a simple-minded sort 
of way, how it is that these properties exist. We will revert to a microscopic 
description of matter and see how the polarizability of individual molecules 
leads to a reasonable understanding of electric and magnetic susceptibilities. 
Needless to say, we are not prepared at this point for the full rigors of 
quantum theory and statistical mechanics; we will find though that we can 
go a long way with “‘seat-of-the-pants” reasoning and come up with results 


which are not too unreasonable. 
321 
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Electric or magnetic polarization is the result of the average molecule 
having an electric or magnetic dipole moment in a given direction. This 
average moment can arise from two alternative sources or from a combina- 
tion of them. 


1. The molecule (or atom) can normally be free of any intrinsic moment. 
Applying an electric or magnetic field will however induce a moment. 
This, in general, will lead to a relatively small electric or magnetic 
susceptibility. 

2. The molecule may already have an electric or magnetic dipole moment. 
In this case the applied field will try to align this moment. Clearly the 
average polarization is temperature dependent because collisions be- 
tween molecules tend to disrupt the alignment. (It is also possible, 
as in the case of ferromagnetism, that a certain amount of long-range 
coherence exists among a fairly macroscopic group of molecules. The 
extent of this coherence will also be temperature dependent. Needless 
to say, coherence can lead to very large susceptibilities.) 


We will begin by discussing the electric susceptibility of material whose 
molecules have no intrinsic electric dipole moments. This treatment is 
relatively simple because we have no need to worry about the disrupting 
influence of collisions. We will then introduce some notions which derive 
from statistical mechanics and which will permit us to deal with a large 
number of interacting molecules at a given temperature 7. When we apply 
these notions to molecules having intrinsic electric dipole moments, we 
will be able to estimate the extent of their average polarization in the presence 
of an external electric field. The same types of treatment will be carried out 
for magnetism. 

Finally, we will say a few words about ferromagnetism and its origin 
in the closely coupled behavior of neighboring atoms. 


10-1 THE ELECTRIC POLARIZABILITY ‘OF NONPOLAR 
MOLECULES HAVING SPHERICAL SYMMETRY 


Naturally, we start with the simplest molecular system, an atom having 
complete spherical symmetry. We can think of the electrons’ charge as 
being distributed over a sphere of about 0.5 x 107% cm. The nucleus is of 
negligible size (~ 10~'* cm) and can be thought of, for our purposes, as a 
point charge at the center of the electron cloud. We would like to find out 
how much of a dipole moment is induced if an external field E, is applied 
(see Fig. 10-1). 

To first approximation the nucleus will move relative to the electrons 
to the point where the restoring force from the electron cloud just equals the 
applied field. At a distance r from the center of a uniform charge distribution 
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Fig. 10-1 An applied electric field causes the 
positive nucleus to be displaced relative to the 
center of the electron cloud, leading to a net 
electric dipole moment. 


of radius R and total charge Q, the field is given by (see page 33) 


r 
= S (10-1-1) 
In our case Q = — Ze where —e is the electron charge and Z is the atomic 
number. In order that E just precisely balance E,, we must have 
R3 
=—E ies 
r=3, Be (10-1-2) 
The dipole moment p of the atom is then 
R 
= Ze —E 
p e Ze a 
= RE, (10-1-3) 


In general, we can think of our atom as having an atomic polarizability « 
such that ping = &E,,,. In the case of the simple model we have taken above, 
the polarizability is just equal to R*. This result is not in terrible disagree- 
ment with what we would have obtained had we done a complete quantum- 
mechanical calculation. For example, in the case of atomic hydrogen, a 
turns out to be equal to $a)* where ap is the Bohr radius (a9 = 0.52 x 10° 
cm). 

Notice that the displacement of the nucleus is not very large for the 
sort of fields that are available in the laboratory. Typical high fields are in 
the hundreds of gauss (100 gauss = 30 kV/cm). Using our value of 0.5 x 
107° cm for R and letting Z = 1, we find, for a field of 100 gauss, that r = 
2.5 x 107 +4* cm. This is a very small displacement indeed, smaller in fact 
than the radius of the nucleus itself. 

On the whole we expect a certain amount of variation of atomic 
polarizability as we vary the details of the atomic structure. For example, 
we expect that noble gases with closed shells and no valence electrons might 
be harder to distort than alkali metals with 4 relatively free valence electron. 
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We also anticipate a modification of the elementary atomic polarizabilities 
when atoms are bound into molecules. These variations can be substantial 
and can run over several orders of magnitude. Hence precise measurements 
of polarizabilities can serve as important analytical tools in the study of 
chemical structures. 


10-2 THE RELATION BETWEEN ATOMIC POLARIZABILITY 
AND ELECTRIC SUSCEPTIBILITY 


Some time ago [see Eq. (2-6-22)] we defined the proportionality constant 
between electric field and dipole moment per unit volume to be the electric 
susceptibility y,. That is to say, P = y,E. One would naively guess that 
the ratio between the electric susceptibility y, and the atomic polarizability 
(which we will call «,) would just be the number of atoms per unit volume N. 
This is however not exactly the case; we must be careful to read the fine 
print in our definitions. When we defined polarizability, we related the 
induced dipole moment p to the applied electric field E,. That is to say, 
p = «,E,. On the other hand, the susceptibility equation relates the average 
dipole moment per unit volume at some point in the dielectric to the total 
field at that point. Since the applied field on the dipole does not include any 
part of the field caused by the dipole itself, we must be careful to remove that 
field when evaluating the atomic polarizability in terms of x,. 

The applied field E, at the dipole is just equal to the total field E 
minus the field due to the dipole itself. We must average everything over the 
volume taken up by the dipole, which we will call V. We have then 


{Ev = (Edy — <Extom>v (10-2-1) 


To carry out this averaging process rigorously, taking into account 
the complete charge distribution of the atom, is difficult. We will make the 
approximation that the distance between atoms is much larger than the 
atomic size. We will then let V be the average volume per dipole, or 1/N. 
The assumption that V is much larger than the atomic size permits us to 
make use of only the leading dipole term [Eq. (2-6-7)] to obtain the field 
due to the atom itself within V. We have then 


, 1 
CE stom) V = val Bon dV 


1 1 


where r’ is the location of the atomic dipole. 
We convert this to a surface integral, obtaining 


1 1 
E = — = es le 
Extom¥ = 7 = (> ¥ = =) dA (10-2-3) 
4 


of 
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If we now simplify matters by letting V be a sphere of radius b about the 
point r’, we find 


—] 
(Ertom?v . Ver on U ; oa ios 
of V 
—3np 
V 
(A little bit of thought will convince the reader that the shape of the volume 
V is irrelevent provided it is symmetrically situated about the position of 


the dipole.) 
Taking V = 1/N, we have our result 


CExtom>v = —30Np = —$nP (10-2-5) 
Finally, going back to Eq. (10-2-1), we can write 


(10-2-4) 


Je 
{Ey = a <E>y + 32P 


l 4n 
=(— +—]P 
G wie 
This yields the equation for x, in terms of «,: 


he = 1 — GnNa,/3 ee) 

In the limit where V becomes very small, y, reduces, as expected, to 
Na,. We might ask about the other limit, where N gets large. Is it really 
possible that y, becomes infinite for a particular value of N and then becomes 
negative for N greater than that value? We remember that «, is about equal 
to the volume of the atomic charge distribution [see Eq. (10-1-3)]. When 
we began our derivation for x,, we assumed that the volume of the charge 
distribution was very small compared with the average volume per dipole. 
That is, 

1 

te << > 
or 

Na, << 1 (10-2-7) 


Thus the domain of validity of Eq. (10-2-6) does not include the situation 
where 42Na,/3 is comparable with 1. 


10-3 POLARIZABILITY AS A SECOND-RANK TENSOR 


The most general linear relationship between the applied electric field E, 
and the polarization p of a molecule is that of a second-rank tensor with 
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components «,;. That is, 
3 z 
D; = >. 0,;E; (10-3-1) 
j= 

We will now demonstrate a rather important property of the tensor a, 
namely, its symmetry. The component a,; is the same as the component @;,. 
To see the physical meaning of this let us take our electric field, for example, 
in the y direction and of magnitude E,. This would give rise to a polarization 
component in the x direction equal to a,,£,. If instead we had taken the 
same magnitude of electric field along the x direction, we would have found 
a y component of polarization equal to a,,E£,. If «,, = 4,2, we could 
conclude that these two resultant components are equal. 

The demonstration of the symmetry of « rests upon the very simple 
observation that the electrostatic energy of a given charge distribution 
cannot depend upon the historical sequence of events by which it was built 
up. All that should matter when we determine the interaction energy of a 
given dipole p and the applied field E, are the magnitude of p and E, and 
the angle between them. Suppose we change E, slightly and allow p to 
change accordingly by an amount dp. The amount of work done on the 
charges in p by the field E, is just 


dW =E,:dp (10-3-2) 
Suppose now that we allow the z component of E, to be zero and let E,, 


and E,, be the x and y components, respectively. To arrive at these final 
values for the components of E,, we can proceed in one of two ways: 


1. Wecan first build up the x component of E, to its final value, allowing 
the y component of E, to remain zero. We then raise the y component 
of E, to its final value. 

2. We can reverse the order, first bringing up the y component of E, 
and then the x component of E,. 


Let us begin first with procedure 1 and evaluate the work done by 
E, on p as we bring it up by the indicated two-step process. In the first 
step we have 


Ea Eat 
W,(a) = | E,, 4p, = | @11E,, dE, 
0 0 
= $0,,E,,? 
In the second step of procedure 1 we have 
Wy(@) = 0 12Eq1Eg2 + 322 Eq2” 
The total work for procedure 1 is thus 
W(a) = 3051 Egy? + $022 E jo” + O22 a1 Ene (10-3-3) 


* 
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Completely analogously, procedure 2 yields 

WB) = Stake + 302 Eo + topBs Eo. (10-3-4) 
Letting W(b) = W(a), we conclude that 

Contes Oii2 


Obviously, the same proof works for any pair of indices i and j. Hence we 
generalize for all i and j 


Oo. = Oj 


(10-3-5) 


iy 

One very important consequence of this symmetry is the possibility 
of diagonalizing the polarizability tensor (see Sec. 1-7). We can find three 
mutually perpendicular directions in which p and E, are colinear. These 
three axes are three fundamental axes of symmetry for the system. The 
remarkable point is that three such axes of symmetry exist for the polariz- 
ability even though the charge distribution itself may be totally asymmetrical 
and show no axis of symmetry. 


10-4 THE POLARIZABILITY OF A POLAR MOLECULE 


Quite commonly the molecules out of which our dielectric is composed have 
a built-in electric dipole moment. If the dielectric is a liquid or a gas, then 
the molecules will be relatively free to rotate and will tend to line themselves 
up with their dipole moments lying along the direction of an applied electric 
field. Were it not for the fact that collisions between molecules are continually 
upsetting this alignment, one might expect the molecules to eventually all 
point along the field. Our chore then in this section is to understand, first, 
how large the permanent dipole moment of a typical polar molecule is 
likely to be and, second, to calculate, making use of some basic ideas from 
statistical mechanics, a value for the polarizability of the molecule as a 
function of temperature. 

Water is a very typical polar molecule. Referring to Fig. 10-2, we see 
that the water molecule is quite asymmetric. The two hydrogen atoms are 


Fig. 10-2 The water molecule has a net dipole 
moment because of the fact that the electrons 
from the hydrogen spend a part of their time 
“going around” the oxygen nucleus. 
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attached to the oxygen atom so as to have the OH axes make an angle of 
about 105° with one another. The electrons from the hydrogen atoms then 
spend a good part of their time around the oxygen atom, leading to a net 
dipole moment, as shown. We can estimate its size very approximately by 
noting that each electron is displaced by about a Bohr radius (a) = 0.5 
x 107% cm) on the average from its proton. This leads to a moment po 
given by 
Po & 2eap cos 52.5° 
= 2(5 x 107!° esu)(0.5 x 107® cm)(0.6) 
~3 x 107'? esucem 
In actual fact, the dipole moment of a water molecule is 
Po(water) = 1.84 x 10718 esu-cm (10-4-1) 


We will next make use of the Boltzmann distribution to determine 
the polarizability of the average polar molecule in an applied electric field 
E, at a temperature 7. We shall not go through the details of deriving the 
Boltzmann distribution equation or justifying it. We will only assert that 
statistically the dipoles will be distributed according to the distribution 
function e~”/*T where W is the dipole’s potential energy in the electric 
field and k is Boltzmann’s constant. That is, the probability of finding the 
dipole pointing into a given solid angle dQ will be proportional to e~”*T 
and dQ. We can thus write 


aN — —-W/kT 
7a Ae (10-4-2) 


where A is chosen to suitably normalize the distribution. If there are No 
dipoles altogether, we have 


No = A lena dQ (10-4-3) 
The potential energy of a dipole py in an applied field E, is just 


W = —po°E, = —poE, cos 8 (10-4-4) 
Substituting in Eq. (10-4-3), we obtain 


0 0 


2n n 
No =A | ao | ePoEacosG/kT sin § dO 


1 
= 2nA | ePoEax/kT oy 


= 


= gene = e’) : (10-4-5) 
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where 
= PoE. 
a= (10-4-6) 
Solving for A, we now have 
i eed 10-4-7 
~ On(e* — e-*) ee) 


To find the average polarization we note that the only component which is 
not averaged out is that along E,. If we let p be the component along E,, 
we have 


P = pocos 8 (10-4-8) 
and 


A 
= me |r. cos 8 e ¥/*T dQ 
0 5 


apa (: ie +) (10-4-9) 


The function on the right of Eq. (10-4-9), called the Langevin function 
L(a), can be expanded for small values of a: 


eee! 
e—e" a 
4a for small a (10-4-10) 


We have then our final result, relating f to E,, 


L(a) 


ie 


a 1 Do" E, 
= — —— 10-4-11 
b= 3 EF ( ) 


The polarizability a, is thus given by 
1 po” 
a 10-4-12 


It is interesting to evaluate this for the water molecule at room temperature 
(300°K). We have then 

1 (1.84)? x 107%° 

3 (1.4 x 107 1°)(300) 


2.7 <10572icme (10-4-13) 


a,(water at 300°K) = 
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Needless to say, this is an order of magnitude larger than the polarizability 
of a nonpolar molecule [see Eq. (10-1-3) ]. 


10-5 DIAMAGNETISM 


We are all quite familiar with the attractive force which the pole of a magnet 
exerts on a piece of iron, but very few of us realize that the same pole can 
repel a drop of water. This is because the forces related to diamagnetism, 
as displayed by the interaction of a magnetic field and water, are so weak 
on the scale we are accustomed to that we tend to ignore them. Nevertheless, 
diamagnetism represents an important example of the application of the 
laws of electrodynamics on the atomic scale and an excellent experimental 
tool for probing atomic structure. 

Diamagnetism is most clearly observed in those atomic systems which 
have no intrinsic magnetic dipole moments. In the event that such a moment 
does exist, the applied field will try to align it, giving rise to paramagnetism, 
a phenomenon which generally masks diamagnetism at room temperature 
and which will be discussed shortly. To treat diamagnetism properly really 
requires solving the quantum-mechanical Schrédinger equation in the 
presence of a magnetic field; nevertheless, we can go a long way toward 
understanding it by means of a crude classical model. At the very least we 
will determine the order of magnitude for the effects we anticipate and their 
dependence on atomic size. 

We will take as our simple model an electron with charge e traveling 
in a circular orbit of radius rg about a nucleus to which it is attracted by an 
electrostatic force Fy. The speed of the electron will be taken as vg; its 
angular momentum about the nucleus has magnitude Ly = mugro. 

Naturally the above atomic system has a magnetic moment py which 
is parallel and opposite in direction to Ly. Purely diamagnetic molecules 
can be produced by tying two of these atoms together in such a way that 
their angular momenta are forced to be in opposite directions. (This anti- 
correlation of the angular momenta .of electrons arises out of the Pauli 
exclusion principle, a rather profound rule which governs the quantum- 
mechanical behavior of all particles having half-integer spin.) Alternatively, 
we can produce a purely diamagnetic medium by heating it sufficiently 
so as to keep the angular momenta of the molecules randomly oriented in 
direction. In any case, we will assume in our treatment that there is no 
preferential direction along which angular momenta are aligned, but rather 
that they are uniformly and isotropically distributed. 

Incidentally, we recall from Eq. (4-10-1) that a simple relationship 
exists between po and Ly: 


eL, 


oo (10-5-1) 


Ho 
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Let us see what happens when we apply a magnetic field B in the same 
direction as Lp (see Fig. 10-3). In the process of bringing the field up to its 
final value, we will induce an electric field (Faraday’s law) which will on 
the average be opposite in direction to the velocity of the electron. This 
will cause the electron to speed up (because of its negative charge) and will 
increase the magnitude of its magnetic moment (opposite in direction to 
B). If we take a time At to bring the magnetic field up, then we expect an 
average component of electric field along the electron’s path given by 


Solving for E, we have 


a roB 
E=- 5. 
Teall (10-5-2) 
In this time interval, E changes the momentum of the electron by an amount 
= B 
eo) — = (10-5-3) 


Before we can evaluate the change in the magnetic moment, we must find 
if the radius of the orbit will be altered by the introduction of the magnetic 
field. Remembering that Fy is the centripetal force excited on the electron 
before the field is turned on, we have 


2 
jen ies (10-5-4) 
ro mro 
The additional centripetal force needed to keep the electron in the same 
orbit after the field has been turned on is just 
Ap eee (10-5-5) 
Mr 


But, using the result from Eq. (10-5-3) for Ap, we find 


ée 
AF = —— 1B (10-5-6) 
c 
Lo 
Ae eae jl na 
i / B ff Fig. 10-3 We apply a magnetic field B toa 
classical atom having magnetic moment py 
| as shown. This causes a change in magnetic 
Ho moment opposite in direction to B. 
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This is exactly the extra force which is supplied by the magnetic field B 
acting on the electron as it moves with velocity vg around the nucleus. 
Hence the radius ry remains unchanged, and the change in icine moment 
is given by 


eAL _ ero Ap m 
2mc—-.2mc 


ayo 


ee 10-5-7 
7 4mc? (10-5-7) 


In the above example, the applied field caused the (negative) electron to 
speed up, inducing an increase in the (negative) magnetic moment. If indeed 
we had taken L, opposite in direction to B, the electron would have started 
with its magnetic moment pointing along B. It would have slowed down 
upon application of the field, and, again, the change in magnetic moment 
would be opposite in direction to B. Indeed, retracing our steps, we see 
that Ap is again given, exactly as before, by Eq. (10-5-7). 

Suppose now that we had set our atom with its magnetic moment po 
at right angles to the applied field B. As we learned back in Chap. 4 the 
angular momentum and magnetic moment would precess about B as 
shown in Fig. 10-4. Looking along the direction of B, we would see the 
plane of the atom rotating clockwise. This clockwise rotation would look 
to us like an added magnetic moment pointing opposite in direction to B. 
The angular frequency associated with this precession is [see Eq. (4-10-2) } 


B 
pa (10-5-8) 


To find the magnetic moment Ay associated with w, we must first 
find R?, the mean squared distance of the electron from the axis of pre- 
cession. Remembering that the magnetic moment of a current loop is equal 
to the current times the area of the loop, we have 


|An| = — Li oR? (10-5-9) 


pad S 
@ Mii 


! 

H Fig. 10-4 The magnetic moment p precesses about 

\ the magnetic field B giving rise to an additional 

B \ component of magnetic moment opposite in direc- 
“2 tion to B. 
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A simple geometrical evaluation will demonstrate that 


—~ ir 
ro oe (10-5-10) 
Substituting into Eq. (10-5-9), we have finally 
e*ry7B 


Ap = (10-5-11) 


8mc? 


In fact we can combine the results expressed in Eqs. (10-5-7) and (10-5-11) 
into one equation which works for any orientation: 
e?R°B 


An = — 
o 4mc? 


(10-5-12) 


where again R* is the mean square distance of the electron from a line 
through the nucleus and parallel to B. 

It is illuminating to get some feeling as to the magnitude of Ap and 
compare it with the magnetic moment of the electron itself. If we take 
ro as 10° ® cm, we obtain 


|Au| = 3.5 x 10778 Besu-cm 


The electron’s magnetic moment, g eL/2mc [see Eq. (4-10-4)], can be 
calculated if we let L take on the quantized value of h/4x where h is Planck’s 
constant. For an electron, g = 2, and hence 


~ 9.2 x 1072! esu-cm. 


co 4nmce 


We note then that even in fields as high as 30 kilogauss the induced 
magnetic moment per atom is at least three orders of magnitude less than 
the magnetic moment of the electron itself. Small wonder that diamagnetic 
effects are relatively small and easily masked by paramagnetic effects, 
especially at low temperatures. 
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As we have mentioned earlier, paramagnetism results from the fact that 
an atomic or molecular system may have a residual magnetic moment after 
all energetically feasible “‘cancellations” among electrons have taken place. 
For example, if the molecule has an odd number of electrons, then there 
is no way in which the effect of the intrinsic magnetic moment of the last 
odd electron can be fully eliminated. Hence, just as when we were dealing 
with an electric field and polar molecules, there will be a predisposition for 
the residual magnetic moments to align themselves with the field. Again 
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the extent of alignment will depend on the temperature and will be calculable 
by means of the Boltzmann distribution function e ““. The only problem 
we have is what to use for W because, as we discovered in Chap. 4, the 
variation in the energy of the system as we rotate our dipole around will 
depend on just exactly what we are keeping constant—the current in the 
dipole, or the flux through the dipole, or perhaps something else. Indeed 
it would seem at first glance that we would have to understand the detailed 
dynamics of the atom itself to make some evaluation of W. 

Fortunately, this is not the case. As we learned when we studied 
diamagnetism, the change in the magnetic moment of our atom, even for 
a fairly large applied field, is small and can be ignored. Hence, as far as an 
outside observer is concerned, the dipole always has a torque equal to 
Hy x B, acting on it, where wo is the magnetic moment in the absence of 
field and B, is the applied field. This torque is such as to try to get wo and 
B, in the same direction. Thus the externally observable mechanical behavior 
of the magnetic dipole in a magnetic field is exactly the same as if we re- 
placed its magnetic moment by an equal electric dipole moment and the 
applied magnetic field by an equal applied electric field. We have then 


27 ee: (10-6-1) 


This leads us [see Eq. (10-4-9)] to a result for the average value of the 
component of # along B given by 


, a—— | 
i= Ho ( ee =) (10-6-2) 


q = 10-8 (10-6-3) 


For small values of a we can expand the Langevin function as before with 
the result 


(10-6-4) 


It is again instructive to have a look at the order of magnitude of this 
polarization and compare it with diamagnetism. We can evaluate ji/u) 
in terms of B, and T if we allow po to be equal to the electron’s intrinsic 
moment 


4 HoB, 
Ho 3 kT 


B 
~ 0.22 x 10-4 22 
3 T 
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For comparison. we find for the case of diamagnetism that 


ji e RB, . 
= = x = —0.38 x 1077 B, 
Ho/ diamag 4mc* Uo 


We see then that at room temperature paramagnetic effects are not 
orders of magnitude larger than diamagnetic effects. We might just determine 
the magnitude of the force on samples of a typical paramagnetic material 
at room temperature in a reasonable magnetic field gradient. Let us set up 
a magnetic field along the z direction of our coordinate system of the form 


B = B(z) = @ + xz)k (10-6-5) 


That is to say, B increases linearly with increasing z in our region of interest. 
Let us now place our sample, consisting of N paramagnetic atoms, at the 
origin of our coordinate system. It will develop a total magnetic moment of 


Naik = — ——— -6- 
k = 3 ‘iT (10-6-6) 
Making use of Eq. (4-7-20), we find the force on the sample: 
a 1 Nuy?Boxk 
b= Ni 3B = — No Bokk (10-6-7) 


0z 3 kT 
We take the following typical values (for 1 gram of sample): 
N = 4x10? 
By = 10,000 gauss 


ll 


k = 1,000 gauss/cm 


T = 300°K 


Inserting these values into Eq. (10-6-7), we find the magnitude of F to be 
about 175 dynes. This is not very much at all, not even 20 percent of the 
weight of the sample. Clearly then, paramagnetism alone is not responsible 
for the large forces on iron in the vicinity of a magnet. 

Before we go on to understand the origin of ferromagnetism, we 
should make one final observation related to paramagnetism. When we 
develop the quantum theory of atoms, we will discover that the possible 
orientations of ft with respect to B, are quantized, whereas in our derivation 
we assumed that any orientation was possible. This does not make a large 
qualitative difference to our result. For example, in the case of an atom 
having only a single electron spin contributing to its moment, there are 
two orientations possible, parallel and antiparallel to the applied field. 
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In this case the Langevin equation becomes 


Ho kT 


= Hoe for low field or high temperature (10-6-8) 
Comparison with Eq. (10-6-4) indicates only a factor of 3 difference in the 
low-field or high-temperature approximation. The agreement of quantum 
theory with classical theory improves as we deal with larger atomic angular 
momenta and a correspondingly larger number of possible orientations 
relative to the applied field. 

None of what we have done so far would give us any clue to the re- 
markable phenomena of ferromagnetism. If we take an iron atom all by 
itself, then we would find that 11 of its 26 electrons would have their spins 
lined up in one direction and 15 would have their spins lined up in the 
opposite direction. This would imply an excess of 4 Bohr magnetons of 
magnetic moment for the atom as a whole. (The unit of electron magnetic 
moment, eh/2mc, where h is Planck’s constant divided by 22 and m is 
the electron mass, is called the Bohr magneton.) The excess moment would 
actually occur in the next-to-last atomic shell which is not completely 
filled. Naturally we can calculate how much magnetic field this magnetic 
moment could lead to in its immediate vicinity and we could then estimate 
how much correlation should exist between neighboring atoms. It is not 
hard to estimate the magnitude of this effect; needless to say it is nowhere 
near what is needed to establish the high degree of correlation which exists 
between neighboring atoms in a crystal of iron or nickel. There must then 
be a new type of ‘“‘force” which is acting here to establish the strong tendency 
of neighboring atoms to have parallel spin. 

To really understand this force even qualitatively requires a study of 
quantum mechanics beyond the scope of this book. One of the fundamental 
rules governing the behavior of particles with intrinsic angular momentum 
equal to a half-integer times Planck’s constant divided by 22 [spin = 
(n/2)h] is the Pauli exclusion principle. This principle forbids two such 
particles from occupying identical quantum-mechanical states at exactly 
the same time. (That is why the two and only two electrons in the innermost 
shell of an atom have their spins pointing in opposite directions.) One 
result of this principle is the fact that two highly overlapping electron clouds 
tend to have opposite magnetic moments. Now two neighboring iron atoms 
in an iron crystal are bound together as a result of the sharing of the electrons 
in their outermost shells. These electrons in turn act on the two sets of 
aligned electrons in the unfilled inner shells and cause them to line up together 
It is this so-called exchange force between the two atoms that gives rise to 
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ferromagnetism. If we were to examine unmagnetized iron microscopically, 
we would find that it was made up of almost macroscopic domains, each 
with typical volume of about 107° mm?. Within a given domain the moments 
of the individual atoms all point in the same direction with an average 
moment per atom of about 2.2 Bohr magnetons. As we apply an external 
field to the iron the domain boundaries begin to move. Those domains whose 
moments are along the direction of the applied field grow at the expense 
of domains whose alignment is not as fortunate. Of course, the more the 
applied field, the more overall alignment we obtain until we reach the 
point of saturation. At this point, typically 15,000 gauss in iron, the leading 
domains have conquered all there is to conquer and no more magnetization 
can be induced. 

Needless to say, if we raise the temperature of our iron we reach the 
point where the high correlation between neighboring atoms is destroyed. 
Above this temperature, called the Curie point, iron is more or less a 
paramagnetic solid. 


PROBLEM 


10-1. Estimate the magnetic field which one iron atom produces at the site of its 
neighbor in an iron crystal. Compare the magnetic interaction energy of two 
neighboring iron atoms with AT for T = 300°K. 
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Tables 


CONVERSION TO “PRACTICAL” UNITS 


: Symbol Gaussian Practical 
Quantity ymbo ae ae Relation 
Distance K) cm meter 1 meter = 100 cm 
Mass m gram kg 1 kg = 1000 grams 
Time t sec sec 
Velocity v cm/sec meter/sec 1 meter/sec = 100 cm/sec 
Work, energy W erg joule 1 joule = 10’ ergs 
Charge q esu coulomb 1 coulomb = 2.998 x 10° esu 
Current I emu/sec ampere 1 ampere = 107 ' emu/sec 
(abampere) 
1 
Electric field E gauss volt/cm 1 volt/em = 5098 gauss 
| 
Electric 7) statvolt volt 1 volt = 3098 statvolt 
potential 
Magneticfield B gauss weber/m? 1 weber/m? = 10* gauss 
Resistance R statvolt/ ohm 1 ohm = . statvolt/abampere 


29.98 
abampere 
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FUNDAMENTAL CONSTANTS 


Quantity Symbol Value 
Speed of light in vacuum c 2.998 x 10!° cm/sec 
Fundamental charge e 4.803 x 107 '° esu 
Planck’s constant h 6.626 x 107?’ erg-sec 
Rest mass of electron m 0.911 x 10~?7 gram 
Rest mass of proton M, 1.672 x 10° ?* gram 
Avogadro’s number No 6.022 x 1073 mole™! 
Boltzmann’s constant k 1.38 x 107° erg/°K 
Electron volt eV 1.602 x 10° '* erg 
Rest energy of electron mc? 0.511 x 10° eV 
Rest energy of proton M,c? 0.938 x 10° eV 
Bohr radius a 0.529 x 10°8 cm 


SOME USEFUL VECTOR RELATIONS 


Viuv) = uVv + vVu 

V(A + B) = (A: V)B + (B- V)A + A x (V x B) + B x (V x A) 
V:-(uA) = uV-A+Vu-A 

V-(A x B)=B-(V x A) —A-(V x B) 

V x (uA) = u(V x A) + Vu xA 

V x (A x B) = (B- V)A — (A- V)B + A(V- B) — B(V- A) 
Vx(V x A)= VV-A)— V7A 

V x (Vu) = 0 

V-(V x A)=0 


[v-Fav= | w-nda 
v s 


[.vxrar= [ax ras 
v s 


[voar = | a0 a1 
Vv Ss 


Feal= | (x Fada 
(& KY 
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Ampere’s law, 148-149 
Antisymmetric tensor of fourth rank, 133 
Axial vector, 9 


Bessel functions, 77 
Biot-Savart law, 145-146 
Bohr magneton, 336 
Bohr radius, 323 
Boundary conditions: 
for electrostatic potential, 42—45 
for radiation field at interface between 
media, 245 
Brewster’s angle, 269 


Capacitance, 54-62 
Cavities: 
energy stored in, 318 


Cavities (cont'd): 
ideal, rectangular, 312-315 
with non-perfectly conducting walls, 315— 
319 
Q, 316, 319 
resonance curve, 316 
TE and TM modes, 313-314 
Charge: i 
electric, 11, 28 
magnetic, 11 
Charge density, 29 
as component of four-vector, 121~123 
magnetic, 134 
Circular polarization of radiation, 223 
Classical radius of electron, 224 
Coefficients of capacitance, 57 
Coherence and incoherence, 289-290 
Completeness, 224 
Conductivity, 140 
341 


342 


Conductors, 34-35, 38-39, 55-62, 140-141, 
235-250 
Coulomb’s law, 11, 28-30 
Curie point, 337 
Curl, 18-19, 67, 69 
Current density: 
electric, 121—123 
magnetic, 134-135 
Cyclotron frequency, 168 
Cylindrical coordinates, 62-67 


D’Alembertian, 126 
Determinant, 5 
Diamagnetism, 163, 330-333 
Dielectric constant, 50 
Dielectrics, 34 
Dipole moment: 

electric, 45-51 

magnetic, 153 
Dirac 6 function, 227 
Displacement current, 203-205 
Displacement field (D), 45-51 
Divergence, 17, 66, 69 


Earnshaw’s theorem, 43 
Eigenvalues of symmetric tensor, 26 
Eigenvectors of symmetric tensor, 26 
Electric charge, 11, 28 
Electric dipole moment, 45-51 
Electric field, definition, 29 
Electric polarizability : 
and electric susceptibility, 324-325 
of nonpolar molecules, 322-324 
of polar molecules, 327—333 
as a second-rank tensor, 325-327 
Electric susceptibility, 50, 324-325 
Electromagnetic units, 123 
Electromotive force, 187-189 
Electron radius (classical), 224 
Electrostatic units, 29 
Energy: 
of a static charge distribution, 51-54 
of a static current distribution, 162—168 
Energy conservation in electromagnetism, 
195-197 
Energy density of electromagnetic field, 
196 
Energy radiation: 
by accelerating charge, 221-222 
by electric dipole radiator, 279 


INDEX 


Energy radiation (cont'd): 
by electric quadrupole radiator, 285-286 
by magnetic dipole radiator, 282 
Exchange force, 336 


Faraday effect, 258-265 
Faraday’s law, 187-195 
Ferromagnetism, 153-154, 336-337 
Flip-coil magnetometer, 194 
Flux: 
of magnetic field through loop, 164 
of vector field, 14-17 
Force: 
on charged particle in electric or magnetic 
fields, 29, 131 
on current distribution, 200 
on current loop, 167—168 
on distribution of charge, 97-100 
Forward-scattering amplitude, 251, 256-258, 
298-300 
Four-vector, 115 
Fourier series and Fourier integral, 225—228 
Fourier transform, 228 
Frauenhofer diffraction: 
multiple slit, 300-301 
single slit, 293-297 
Fresnel’s equations, 265-268 


Gauss’ law, 30-32 

Gauss’ theorem, 17 

Gaussian units, 122 

Gradient, 13, 64, 69 

Green’s reciprocity theorem, 57-58 
Group, 5 


_ Group velocity, 256 


Gyromagnetic ratio (g): 
definition, 174 
measurement of g-2 for muon, 174-179 


H field, 155-161 
Huygen’s principle, 293 


Images, method of, 45 
Index of refraction (see Refractive index) 
Inductance: 
coefficients of, 164-165 
self-, 164 
of coaxial cylinders, 191 


INDEX 


Interaction energy of charge distributions, 94— 
96 

Interference arnorig set of dipole radiators, 
290-292 

Internal reflection, 269-270 


Kronecker 6, 23 


Langevin function, 329 
Laplace’s equation, 42 
in cartesian coordinates, 69-73 
in cylindrical coordinates, 73-78 
in spherical coordinates, 78-90 
Laplacian, 42 
Larmor frequency, 173 
Larmor precession, 172-174 
Larmor radiation formula, 222 
Legendre polynomials, 81-83 
Lienard-Wiechert poteniials, 214-220 
Lorentz force, 131 
Lorentz group, 115 
Lorentz transformation, 110ff 


Magnetic dipole moment, 153 
Magnetic field (B): 
on axis of circular current loop, 147 
determined through Biot-Savart law, 145- 
146 
as gradient of potential function, 149-152 
methods of calculating in absence of time 
dependence, 139ff 
required by Lorentz invariance, 123-126 
in solenoid, 147-148 
in vicinity of long cylindrical wire, 142~ 
143, 149 
Magnetic flux, 164 
Magnetic polarizability, 330-337 
Magnetization, 153-161! 
Mass (rest), 117 
Masses (rest) of some elementary particles, 
120 
Matrix, 5 
four-dimensional rotation, 115 
multiplication, 6 
three-dimensional rotation, 5—7 
transposition, 7 
Maxwell’s equations, derivation, 131-134 
Mean value theorem, 42 
Michelson-Morley experiment, 105-110 
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Momentum and energy conservation (relativ- 
istic), 118 
Momentum in eV/c, 119-120 
Momentum conservation in electrodynamics, 
197-200 
Momentum density of electromagnetic fields, 
200 
Momentum transformation under relativity, 
116-117 
Monopoles, magnetic, 134-137, 194-195 
Motion of charged particle: 
in constant magnetic field, 168-170 
in crossed electric and magnetic fields, 170— 
172 
Multipole expansion: 
of electrostatic potential, 90-94 
of radiation field: 
electric dipole radiation, 276-279 
electric quadrupole radiation, 281, 
282-287 
general introduction, 273-276 
magnetic dipole radiation, 280-282 


Neumann function, 77 
Nuclear shape determination, 96-97 


Ohm’s law, 140-141 
Optical theorem, 300 
Orthogonality, 224—225 


Paramagnetism, 153, 333-336 
Pauli exclusion principle, 330 
Permeability, magnetic, 156 
Phase velocity, 253-255 
Polar vector, 9 
Polarizability : 
electric (see Electric Polarizability) 
magnetic, 330-337 
Polarization: 
of dielectric (P), 45-51 
of radiation, 222-223 
Polarization charge, 49 
Potential : 
as component of a four-vector, 123-126 
in electrostatics, definition, 36-37 
time-dependent, retarded, 205-208, 213~ 
214 
uniqueness theorems in electrostatics, 42— 
45 


Ma 


Potential coefficients, 60 
Poynting vector: 
detinition, 195—196 
time averaged for radiation -field, 
278-279 
Pseudoscalar, 8 


Quadrupole moment: 
of charge distribution, 93, 284 
of nucleus, 97 
Quadrupole tensor of charge distribution, 
284-285 


Radiation: 
absorption and reflection by material, 
235ff 
scattering of: 
by free charge, 223-224 
by harmonically bound charge, 
229-231 
Radiation fields due to accelerating charge, 
219-220 
Refractive index: 
less than 1, 253-256 
physical origin, 250-253 
in terms of ¢ and p, 289 
in terms of forward-scattering amplitude, 
256-258 
Resistance, 141 
Rest mass, 117 
mechanical and electromagnetic compo- 
nents, 200—203 


Scalar, 7 

Skin depth, 244, 250, 319 

Spherical coordinates, 67-69 

Stokes’ theorem, 20 

Stress tensor, 23 
electromagnetic, 200 
electrostatic, 97-100 
magnetostatic, 179-183 


INDEX 


Susceptibility: 
electric, 50, 324 
magnetic, 156, 330-337 


Tensor of second rank: 
definition, 22 
diagonalizing a symmetric tensor, 24-26 
eigenvalues and eigenvectors of, 26 
electric and magnetic field as components 
of, 129-130 
in four dimensions, 116 
symmetric and antisymmetric, 24 
Thomson cross section, 223-224 
Torque on a current loop, 167 
Transverse nature of electromagnetic radia- 
tion, 220 


Vector potential: 
required by Lorentz invariance, 123-126 
time-dependent, 205-208 
used to find magnetic field, 141-145 
Vectors: 
curl, 18, 67, 69 
“‘del’’ operator, 13 
differentiation, 12-14 
divergence, 17, 66, 69 
elementary operations, 2—3 
in four dimensions, 115 
polar and axial, definition, 9 
transformation properties, 3-12 
Velocity transformation equations (relativis- 
tic), 116 
Verdet constant, 258, 262 
Virtual work, method of, 166 


Water as polar molecule, 328 

Waveguides: 
dominant mode, 310 
perfectly conducting, rectangular, 305-312 
phase and group velocity, 311 
TE and TM modes, 306 
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of extremely useful mathematical tool with wide applications. Volterra Equations, 
Fredholm Equations, much more. Advanced undergraduate to graduate level. 
Exercises. Bibliography. 238pp. 5% x 8%. 64828-1 Pa. $7.95 


CELESTIAL OBJECTS FOR COMMON TELESCOPES, T.W. Webb. Inesti- 
mable aid for locating and identifying nearly 4,000 celestial objects. 77 illustrations. 
645pp. 5% x 8. 20917-2, 20918-0 Pa., Two-vol. set $12.00 


MODERN NONLINEAR EQUATIONS, Thomas L. Saaty. Emphasizes practical 
solution of problems; covers seven types of equations. “, . . a welcome contribution 
to the existing literature. . . ..—Math Reviews. 490pp. 5% x 8%. 64232-1 Pa. $9.95 


FUNDAMENTALS OF ASTRODYNAMICS, Roger Bate et al. Modern approach 
developed by U.S. Air Force Academy. Designed as a first course. Problems, 
exercises. Numerous illustrations. 455pp. 5% x 8%. 60061-0 Pa. $8.95 


INTRODUCTION TO LINEAR ALGEBRA AND DIFFERENTIAL EQUA- 
TIONS, John W. Dettman. Excellent text covers complex numbers, determinants, 
orthonormal bases, Laplace transforms, much more. Exercises with solutions. 
Undergraduate level. 416pp. 5% x 8%. 65191-6 Pa. $9.95 


INCOMPRESSIBLE AERODYNAMICS, edited by Bryan Thwaites. Covers theo- 
retical and experimental treatment of the uniform flow of air and viscous fluids past 


two-dimensional aerofoils and three-dimensional wings; many other topics. 654pp. 
5% x 8%, 65465-6 Pa. $16.95 


INTRODUCTION TO DIFFERENCE EQUATIONS, Samuel Goldberg. Excep- 
tionally clear exposition of important discipline with applications to sociology, 
psychology, economics. Many illustrative examples; over 250 problems. 260pp. 
5% x Bie. 65084-7 Pa. $7.95 


LAMINAR BOUNDARY LAYERS, edited by L. Rosenhead. Engineering classic 
covers steady boundary layers in two- and three-dimensional flow, unsteady 


boundary layers, stability, observational techniques, much more. 708pp. 5% x 8%. 
65646-2 Pa. $15.95 


LECTURES ON CLASSICAL DIFFERENTIAL GEOMETRY, Second Edition, 
Dirk J. Struik. Excellent brief introduction covers curves, theory of surfaces, 
fundamental equations, geometry on a surface, conformal mapping, other topics. 
Problems. 240pp. 5% x 8%. 65609-8 Pa. $7.95 


CATALOG OF DOVER BOOKS 


ROTARY-WING AERODYNAMICS, W.Z. Stepniewski. Clear, concise text covers 

aerodynamic phenomena of the rotor and offers guidelines for helicopter per- 

formance evaluation. Originally prepared for NASA. 537 figures. 640pp. 6% x 9%. 
64647-5 Pa. $15.95 


DIFFERENTIAL GEOMETRY, Heinrich W. Guggenheimer. Local differential 
geometry as an application of advanced calculus and linear algebra. Curvature, 


transformation groups, surfaces, more. Exercises. 62 figures. 378pp. 5% x 8%. 
63433-7 Pa. $7.95 


INTRODUCTION TO SPACE DYNAMICS, William Tyrrell Thomson. Com- 
prehensive, classic introduction to space-flight engineering for advanced under- 
graduate and graduate students. Includes vector algebra, kinematics, transforma- 
tion of coordinates. Bibliography. Index. 352pp. 5% x 84. 65113-4 Pa. $8.95 


A SURVEY OF MINIMAL SURFACES, Robert Osserman. Up-to-date, in-depth 
discussion of the field for advanced students. Corrected and enlarged edition covers 


new developments. Includes numerous problems. 192pp. 5% x 8%. 
64998-9 Pa. $8.95 


ANALYTICAL MECHANICS OF GEARS, Earle Buckingham. Indispensable 
reference for modern gear manufacture covers conjugate gear-tooth action, gear- 
tooth profiles of various gears, many other topics. 263 figures. 102 tables. 546pp. 
5% X 8%, 65712-4 Pa. $11.95 


SET THEORY AND LOGIC, Robert R. Stoll. Lucid introduction to unified 
theory of mathematical concepts. Set theory and logic seen as tools for conceptual 
understanding of real number system. 496pp. 5% x 8%. 63829-4 Pa. $10.95 


A HISTORY OF MECHANICS, René Dugas. Monumental study of mechanical 
principles from antiquity to quantum mechanics. Contributions of ancient Greeks, 
Galileo, Leonardo, Kepler, Lagrange, many cthers. 671 pp. 5% x 8%. 

65632-2 Pa. $14.95 


FAMOUS PROBLEMS OF GEOMETRY AND HOW TO SOLVE THEM, 
Benjamin Bold. Squaring the circle, trisecting the angle, duplicating the cube: 
learn their history, why they are impossible to solve, then solve them yourself. 
128pp. 5% x 8%. 24297-8 Pa. $3.95 


MECHANICAL VIBRATIONS, J.P. Den Hartog. Classic textbook offers lucid 
explanations and illustrative models, applying theories of vibrations toa variety of 
practical industrial engineering problems. Numerous figures. 233 problems, 
solutions. Appendix. Index. Preface. 436pp. 5% x 84. 64785-4 Pa. $9.95 


CURVATURE AND HOMOLOGY, Samuel I. Goldberg. Thorough treatment of 
specialized branch of differential geometry. Covers Riemannian manifolds, topol- 
ogy of differentiable manifolds, compact Lie groups, other topics. Exercises. 315pp. 
5% 8%. 64314-X Pa. $8.95 


HISTORY OF STRENGTH OF MATERIALS, Stephen P. Timoshenko. Excel- 
lent historical survey of the strength of materials with many references to the 
theories of elasticity and structure. 245 figures. 452pp. 5% x 8%. 61187-6 Pa. $10.95 


CATALOG OF DOVER BOOKS 


GEOMETRY OF COMPLEX NUMBERS, Hans Schwerdtfeger. Illuminating, 
widely praised book on analytic geometry of circles, the Moebius transformation, 
and two-dimensional non-Euclidean geometries. 200pp. 5% x 84. 

63830-8 Pa. $6.95 


MECHANICS, J.P. Den Hartog. A classic introductory text or refresher. Hundreds 
of applications and design problems illuminate fundamentals of trusses, loaded 
beams and cables, etc. 334 answered problems. 462pp. 5% x 8% 60754-2 Pa. $8.95 


TOPOLOGY, John G. Hocking and Gail S. Young. Superb one-year course in 
classical topology. Topological spaces and functions, point-set topology, much 
more. Examples and problems. Bibliography. Index. 384pp. 5% x 84. 

65676-4 Pa. $8.95 


STRENGTH OF MATERIALS, J.P. Den Hartog. Full, clear treatment of basic 
material (tension, torsion, bending, etc.) plus advanced material on engineering 
methods, applications. 350 answered problems. 323pp. 5% x 8%. 60755-0 Pa. $8.95 


ELEMENTARY CONCEPTS OF TOPOLOGY, Paul Alexandroff. Elegant, 

intuitive approach to topology from set-theoretic topology to Betti groups; how 

concepts of topology are useful in math and physics. 25 figures. 57pp. 5% 8%. 
60747-X Pa. $3.50 


ADVANCED STRENGTH OF MATERIALS, J.P. Den Hartog. Superbly written 
advanced text covers torsion, rotating disks, membrane stresses in shells, much 
more. Many problems and answers. 388pp. 5% x 8%. 65407-9 Pa. $9.95 


COMPUTABILITY AND UNSOLVABILITY, Martin Davis. Classic graduate- 
level introduction to theory of computability, usually referred to as theory of 
recurrent functions. New preface and appendix. 288pp. 5% x 8%. 61471-9 Pa. $7.95 


GENERAL CHEMISTRY, Linus Pauling. Revised 3rd edition of classic first-year 
text by Nobel laureate. Atomic and molecular structure, quantum mechanics, 
statistical mechanics, thermodynamics correlated with descriptive chemistry. 
Problems. 992pp. 5% x 8%. 65622-5 Pa. $19.95 


AN INTRODUCTION TO MATRICES, SETS AND GROUPS FOR SCIENCE 
STUDENTS, G. Stephenson. Concise, readable text introduces sets, groups, and 
most importantly, matrices to undergraduate students of physics, chemistry, and 
engineering. Problems. 164pp. 5% x 8%. 65077-4 Pa. $6.95 


THE HISTORICAL BACKGROUND OF CHEMISTRY, Henry M. Leicester. 
Evolution of ideas, not individual biography. Concentrates on formulation of a 
coherent set of chemical laws. 260pp. 5% x 8%. : 61053-5 Pa. $6.95 


THE PHILOSOPHY OF MATHEMATICS: An Introductory Essay, Stephan 
Korner. Surveys the views of Plato, Aristotle, Leibniz & Kant concerning proposi- 
tions and theories of applied and pure mathematics. Introduction. Two appen- 
dices. Index. 198pp. 5% x 8. 25048-2 Pa. $6.95 


THE DEVELOPMENT OF MODERN CHEMISTRY, Aaron J. Ihde. Authorita- 
tive history of chemistry from ancient Greek theory to 20th-century innovation. 
Covers major chemists and their discoveries. 209 illustrations. 14 tables. Bibliog- 
raphies. Indices. Appendices. 851 pp. 5% x 8%. 64235-6 Pa. $17.95 
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DE RE METALLICA, Georgius Agricola. The famous Hoover translation of 
greatest treatise on technological chemistry, engineering, geology, mining of early 
modern times (1556). All 289 original woodcuts. 638pp. 6% x 11. 

60006-8 Pa. $17.95 


SOME THEORY OF SAMPLING, William Edwards Deming. Analysis of the 
problems, theory and design of sampling techniques for social scientists, industrial 
managers and others who find statistics increasingly important in their work. 61 
tables. 90 figures. xvii + 602pp. 5% x 8%. 64684-X Pa. $15.95 


THE VARIOUS AND INGENIOUS MACHINES OF AGOSTINO RAMELLI: A 
Classic Sixteenth-Century Illustrated Treatise on Technology, Agostino Ramelli. 
One of the most widely known and copied works on machinery in the 16th century. 
194 detailed plates of water pumps, grain mills, cranes, more. 608pp. 9 x 12. (EBE) 

25497-6 Clothbd. $34.95 


LINEAR PROGRAMMING AND ECONOMIC ANALYSIS, Robert Dorfman, 
Paul A. Samuelson and Robert M. Solow. First comprehensive treatment of linear 
programming in standard economic analysis. Game theory, modern welfare 
economics, Leontief input-output, more. 525pp. 5% x 8%. 65491-5 Pa. $13.95 


ELEMENTARY DECISION THEORY, Herman Chernoff and Lincoln E. Moses. 
Clear introduction to statistics and statistical theory covers data processing, 
probability and random variables, testing hypotheses, much more. Exercises. 
364pp. 5% x 8%. 65218-1 Pa. $9.95 


THE COMPLEAT STRATEGYST: Being a Primer on the Theory of Games of 
Strategy, J.D. Williams. Highly entertaining classic describes, with many illus- 
trated examples, how to select best strategies in conflict situations. Prefaces. 
Appendices. 268pp. 5% x 8%. 25101-2 Pa. $6.95 


MATHEMATICAL METHODS OF OPERATIONS RESEARCH, Thomas L. 
Saaty. Classic graduate-level text covers historical background, classical methods of 
forming models, optimization, game theory, probability, queueing theory, much 
more. Exercises. Bibliography. 448pp. 5% x 84. 65703-5 Pa. $12.95 


CONSTRUCTIONS AND COMBINATORIAL PROBLEMS IN DESIGN OF 
EXPERIMENTS, Damaraju Raghavarao. In-depth reference work examines 
orthogonal Latin squares, incomplete block designs, tactical configuration, partial 
geometry, much more. Abundant explanations, examples. 416pp. 5% x 84. 
65685-3 Pa. $10.95 


THE ABSOLUTE DIFFERENTIAL CALCULUS (CALCULUS OF TENSORS), 
Tullio Levi-Civita. Great 20th-century mathematician’s classic work on material 
necessary for mathematical grasp of theory of relativity. 452pp. 5% x 8%. 

63401-9 Pa. $9.95 


VECTOR AND TENSOR ANALYSIS WITH APPLICATIONS, A.I. Borisenko 
and I.E. Tarapov. Concise introduction. Worked-out problems, solutions, exer- 
cises. 257pp. 5% x 84. 63833-2 Pa. $6.95 
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THE FOUR-COLOR PROBLEM: Assaults and Conquest, Thomas L. Saaty and 
Paul G. Kainen. Engrossing, comprehensive account of the century-old combina- 
torial topological problem, its history and solution. Bibliographies. Index. 110 
figures. 228pp. 5% x 8%. 65092-8 Pa. $6.95 


CATALYSIS IN CHEMISTRY AND ENZYMOLOGY, William P. Jencks. 
Exceptionally clear coverage of mechanisms for catalysis, forces in aqueous 
solution, carbonyl- and acyl-group reactions, practical kinetics, more. 864pp. 
5% x BY. 65460-5 Pa. $19.95 


PROBABILITY: An Introduction, Samuel Goldberg. Excellent basic text covers set 
theory, probability theory for finite sample spaces, binomial theorem, much more. 
360 problems. Bibliographies. 322pp. 5% x 8%. 65252-1 Pa. $8.95 


LIGHTNING, Martin A. Uman. Revised, updated edition of classic work on the 
physics of lightning. Phenomena, terminology, measurement, photography, 
spectroscopy, thunder, more. Reviews recent research. Bibliography. Indices. 
320pp. 5% x 8%. 64575-4 Pa. $8.95 


PROBABILITY THEORY: A Concise Course, Y.A. Rozanov. Highly readable, 
self-contained introduction covers combination of events, dependent events, 
Bernoulli trials, etc. Translation by Richard Silverman. 148pp. 5% x 8%. 

63544-9 Pa. $5.95 


THE CEASELESS WIND: An Introduction to the Theory of Atmospheric Motion, 
John A. Dutton. Acclaimed text integrates disciplines of mathematics and physics 
for full understanding of dynamics of atmospheric motion. Over 400 problems. 
Index. 97 illustrations. 640pp. 6 x 9. 65096-0 Pa. $17.95 


STATISTICS MANUAL, Edwin L. Crow, et al. Comprehensive, practical 
collection of classical and modern methods prepared by U.S. Naval Ordnance Test 


Station. Stress on use. Basics of statistics assumed. 288pp. 5% x 8%. 
60599-X Pa. $6.95 


DICTIONARY/OUTLINE OF BASIC STATISTICS, John E. Freund and Frank 
J. Williams. A clear concise dictionary of over 1,000 statistical terms and an outline 
of statistical formulas covering probability, nonparametric tests, much more. 
208pp. 5% x 8%. 66796-0 Pa. $6.95 


STATISTICAL METHOD FROM THE VIEWPOINT OF QUALITY CON- 
TROL, Walter A. Shewhart. Important text explains regulation of variables, uses 


of statistical control to achieve quality control in industry, agriculture, other areas. 
192pp. 5% x 84. 65232-7 Pa. $7.95 


THE INTERPRETATION OF GEOLOGICAL PHASE DIAGRAMS, Ernest G. 
Ehlers. Clear, concise text emphasizes diagrams of systems under fluid or 
containing pressure; also coverage of complex binary systems, hydrothermal 
melting, more. 288pp. 6% x 9%. 65389-7 Pa. $10.95 


STATISTICAL ADJUSTMENT OF DATA, W. Edwards Deming. Introduction to 
basic concepts of statistics, curve fitting, least squares solution, conditions without 


parameter, conditions containing parameters. 26 exercises worked out. 271pp. 
5% x 84. 64685-8 Pa. $8.95 
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TENSOR CALCULUS, J.L. Synge and A. Schild. Widely used introductory text 
covers spaces and tensors, basic operations in Riemannian space, non-Riemannian 
spaces, etc. 324pp. 5% x 8%. 63612-7 Pa. $7.95 


A CONCISE HISTORY OF MATHEMATICS, Dirk J. Struik. The best brief 
history of mathematics. Stresses origins and covers every major figure from ancient 
Near East to 19th century. 41 illustrations. 195pp. 5% x 8%. 60255-9 Pa. $7.95 


A SHORT ACCOUNT OF THE HISTORY OF MATHEMATICS, W.W. Rouse 
Ball. One of clearest, most authoritative surveys from the Egyptians and Phoeni- 
cians through 19th-century figures such as Grassman, Galois, Riemann. Fourth 
edition. 522pp. 5% x 8%. 20630-0 Pa. $10.95 


HISTORY OF MATHEMATICS, David E. Smith. Nontechnical survey from 
ancient Greece and Orient to late 19th century; evolution of arithmetic, geometry, 
trigonometry, calculating devices, algebra, the calculus. 362 illustrations. 1,355pp. 
5% x 84. 20429-4, 20430-8 Pa., Two-vol. set $23.90 


THE GEOMETRY OF RENE DESCARTES, René Descartes. The great work 
founded analytical geometry. Original French text, Descartes’ own diagrams, 
together with definitive Smith-Latham translation. 244pp. 5% x 8%. 

60068-8 Pa. $6.95 


THE ORIGINS OF THE INFINITESIMAL CALCULUS, Margaret E. Baron. 
Only fully detailed and documented account of crucial discipline: origins; 
development by Galileo, Kepler, Cavalieri; contributions of Newton, Leibniz, 
more. 304pp. 5% x 8). (Available in U.S. and Canada only) 65371-4 Pa. $9.95 


THE HISTORY OF THE CALCULUS AND ITS CONCEPTUAL DEVELOP- 

MENT, Carl B. Boyer. Origins in antiquity, medieval contributions, work of 

Newton, Leibniz, rigorous formulation. Treatment is verbal. 346pp. 5% x 8, 
60509-4 Pa. $7.95 


THE THIRTEEN BOOKS OF EUCLID’S ELEMENTS, translated with introduc- 
tion and commentary by Sir Thomas L. Heath. Definitive edition. Textual and 
linguistic notes, mathematical analysis. 2,500 years of critical commentary. Not 
abridged. 1,414pp. 5% x 8%. 60088-2, 60089-0, 60090-4 Pa., Three-vol. set $29.85 


GAMES AND DECISIONS: Introduction and Critical Survey, R. Duncan Luce 
and Howard Raiffa. Superb nontechnical introduction to game theory, primarily 
applied to social sciences. Utility theory, zero-sum games, n-person games, 
decision-making, much more. Bibliography. 509pp. 5% = 8%  65943-7 Pa. $11.95 


THE HISTORICAL ROOTS OF ELEMENTARY MATHEMATICS, Lucas 
N.H. Bunt, Phillip S. Jones, and Jack D. Bedient. Fundamental underpinnings of 
modern arithmetic, algebra, geometry and number systems derived from ancient 
civilizations. 320pp. 5% x 8%. 25563-8 Pa. $8.95 


CALCULUS REFRESHER FOR TECHNICAL. PEOPLE, A. Albert Klaf. Covers 
important aspects of integral and differential calculus via 756 questions. 566 
problems, most answered. 431 pp. 5% x 8%. 20370-0 Pa. $8.95 


CATALOG OF DOVER BOOKS 


CHALLENGING MATHEMATICAL PROBLEMS WITH ELEMENTARY 
SOLUTIONS, A.M. Yaglom and I.M. Yaglom. Over 170 challenging problems on 
probability theory, combinatorial analysis, points and lines, topology, convex 
polygons, many other topics. Solutions. Total of 445pp. 5% x 8%. Two-vol. set. 
Vol. I 65536-9 Pa. $6.95 
Vol. II 65537-7 Pa. $6.95 


FIFTY CHALLENGING PROBLEMS IN PROBABILITY WITH SOLU- 
TIONS, Frederick Mosteller. Remarkable puzzlers, graded in difficulty, illustrate 
elementary and advanced aspects of probability. Detailed solutions. 88pp. 5% x 8%. 

65355-2 Pa. $4.95 


EXPERIMENTS IN TOPOLOGY, Stephen Barr. Classic, lively explanation of 
one of the byways of mathematics. Klein bottles, Moebius strips, projective planes, 
map coloring, problem of the Koenigsberg bridges, much more, described with 
clarity and wit. 43 figures. 210pp. 5% x 8%. 25933-1 Pa. $5.95 


RELATIVITY IN ILLUSTRATIONS, Jacob T. Schwartz. Clear nontechnical 
treatment makes relativity more accessible than ever before. Over 60 drawings 
illustrate concepts more clearly than text alone. Only high school geometry needed. 
Bibliography. 128pp. 6% x 9%. 25965-X Pa. $6.95 


AN INTRODUCTION TO ORDINARY DIFFERENTIAL EQUATIONS, Earl 
A. Coddington. A thorough and systematic first course in elementary differential 
equations for undergraduates in mathematics and science, with many exercises and 
problems (with answers). Index. 304pp. 5% x 8%. 65942-9 Pa. $8.95 


FOURIER SERIES AND ORTHOGONAL FUNCTIONS, Harry F. Davis. An 
incisive text combining theory and practical example to introduce Fourier series, 
orthogonal functions and applications of the Fourier method to boundary-value 
problems. 570 exercises. Answers and notes. 416pp. 5% x 8%. 65973-9 Pa. $9.95 


THE THEORY OF BRANCHING PROCESSES, Theodore E. Harris. First 
systematic, comprehensive treatment of branching (i.e. multiplicative) processes 
and their applications. Galton-Watson model, Markov branching processes, 
electron-photon cascade, many other topics. Rigorous proofs. Bibliography. 
240pp. 5% x 8%. 65952-6 Pa. $6.95 


AN INTRODUCTION TO ALGEBRAIC STRUCTURES, Joseph Landin. 
Superb self-contained text covers “abstract algebra’’: sets and numbers, theory of 
groups, theory of rings, much more. Numerous well-chosen examples, exercises. 
247pp. 5% x 8%. 65940-2 Pa. $6.95 


Prices subject to change without notice. 
Available at your book dealer or write for free Mathematics and Science Catalog to Dept. GI, 
Dover Publications, Inc., 31 East 2nd St., Mineola, N.Y. 11501, Dover publishes more than 175 
books each year on science, elementary and advanced mathematics, biology, music, art, 
literature, history, social sciences and other areas. 
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PRINCIPLES OF 
ELECTRODYNAMICS ~ 


Melvin Schwartz 


Unlike most textbooks on electromagnetic theory, which treat electricity, mag- 
netism,.Coulomb’s law and Faraday’s law as almost independent subjects within 
the framework of the theory, this well-written text takes a relativistic point of view 
in which electric and magnetic fields are really different aspects of the same 
physical quantity. 


Suitable for advanced undergraduates and graduate students, this volume offers a 
superb exposition of the essential unity of electromagnetism in its natural, 
relativistic framework while demonstrating the powerful constraint of relativistic 
invariance. It will be seen that all electromagnetism follows from electrostatics and 
from the requirement for the simplest laws allowable under the relativistic 
constraint. By means of these insights, the author hopes to encourage students to 
think about theories as yet undeveloped and to see this model as useful in other 
areas of physics. 


After an introductory chapter establishing the mathematical background of the 
subject and a survey of some new mathematical ideas, the author reviews the 
principles of electrostatics. He then introduces Einstein’s special theory of relativity 
and applies it throughout the rest of the book. Topics treated range from Gauss’s 
theorem, Coulomb’s law, the Faraday effect and Fresnel’s equations to multiple 
expansion of the radiation field, interference and diffraction, waveguides and 
cavities and electric and magnetic susceptibility. 


Carefully selected problems at the end of each chapter invite readers to test their 
grasp of the material. 


Professor Schwartz received his Ph.D. from Columbia University and has taught 
physics there and at Stanford University. He is perhaps best known for his 
experimental research in the field of high-energy physics and was a co-discoverer 
of the muon-type neutrino in 1962. 


Unabridged Dover (1987) republication of the edition published by the McGraw- 


Hill Book Company, New York, 1972 (International Series in Pure and Applies 
Physics). 112 illustrations. Preface. Index. 352pp. 5% x 8%. Paperbound. 


ISBN O0-44b-b5453-1 
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